December 27, 2018

Editor's Picks

  • The Netflix Data War
    A recent article in the Wall Street Journal, “At Netflix, Who Wins When It’s Hollywood vs. the Algorithm?” by Shalini Ramachandran and Joe Flint details some of the internal debates within Netflix between the Los Angeles-based content team, which is in charge of developing and marketing new content for the streaming service, and the data team. I thought it was a useful place to launch a discussion about the activity of a data team and how it interfaces with other aspects of a company...

Data Science Articles & Videos

  • Trends in Deep Learning with Jeremy Howard
    In this episode of our AI Rewind series, we’re bringing back one of your favorite guests of the year, Jeremy Howard, founder and researcher at Jeremy joins us to discuss trends in Deep Learning in 2018 and beyond. We cover many of the papers, tools and techniques that have contributed to making deep learning more accessible than ever to so many developers and data scientists...
  • The year in AI/ML advances: 2018 roundup
    It has become a sort of tradition for me to try to summarize ML advances at this time of the year. As always, this summary will necessarily be biased by my own interests and focus, but I have tried to keep it as broad as possible...
  • Photo Wake-Up: 3D Character Animation from a Single Photo
    We present a method and application for animating a human subject from a single photo. E.g., the character can walk out, run, sit, or jump in 3D. The key contributions of this paper are: 1) an application of viewing and animating humans in single photos in 3D, 2) a novel 2D warping method to deform a posable template body model to fit the person's complex silhouette to create an animatable mesh, and 3) a method for handling partial self occlusions...
  • Neuroevolution-Bots
    Neuroevolution-Bots is a personal project that demonstrates neuroevolution in a browser environment using TensorFlow.js, Neataptic (for neural nets) and HTML5 Canvas (for graphics). I tried to create a scaled down 2D version of the popular Gym’s Humanoid-v2 environment using Planck.js, a JavaScript rewrite of Box2D...
  • Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
    We introduce an architecture to learn joint multilingual sentence representations for 93 languages, belonging to more than 30 different language families and written in 28 different scripts. Our system uses a single BiLSTM encoder with a shared BPE vocabulary for all languages, which is coupled with an auxiliary decoder and trained on publicly available parallel corpora. This enables us to learn a classifier on top of the resulting sentence embeddings using English annotated data only, and transfer it to any of the 93 languages without any modification...


Training & Resources

  • Deep Graph Infomax
    General approach for learning node representations within graph-structured data in an unsupervised manner based upon mutual information, rather than random walks...
  • Introducing Pandas-Sets: Set-oriented Operations in Pandas
    I frequently find myself storing standard Python set objects in DataFrame columns. This usually happens when I have some kind of a tags or labels column for each observation. It can also be the output of a groupby operation where the end result needs to be a list-like (or set-like) object before it's aggregated. Using set operations (union, intersection etc.) can come in handy in such cases...


