Data Science Weekly Newsletter - Issue 10

Issue #10

January 30 2014

Editor Picks

 
  • Using Data Mining to Predict the Winter Olympics Medal Counts in Sochi

    Which nation will bring home the most medals at the upcoming Winter Olympics in Sochi, Russia? Will any nation from Africa, South America, or the Middle East finally break through and win a medal? Why do some nations win a bundle of medals while others win only a few? Can data mining give us the answers to these questions?...
  • Deep Learning: Teaching Computers To Think Like People

    It's the holy grail of computing: teaching computers to think the way humans do. This Tech Tuesday in-depth interview explores how computers learn and where the next breakthroughs will be, with 3 expert panelists - Geoffrey Hinton, Philip Resnik and Richard Socher...
  • Why Data Science Matters to Foursquare

    Since its most recent update, Foursquare users now spend 30% more time with the check-in app. And it wouldn't have been possible without the input of data scientist Blake Shaw...
 
 

Data Science Articles & Videos

 
  • Cancer: A Computational Disease that AI Can Cure
    Inspired by my experiences as an AI researcher, entrepreneur and cancer survivor... Cancer Commons [is creating], a "rapid learning" community of patients, physicians and researchers. Our goal is to cure cancer by collecting the genomic and response data from thousands of adaptively-planned individual treatment experiments, integrating the resulting sparse fragments of evidence to infer the true causal mechanisms of tumors and drugs, and generalizing the resulting knowledge so it can be applied to new cases...
  • Backtesting a Forecasting Strategy for the S&P500 in Python with Pandas
    Recently on QuantStart we've discussed machine learning, forecasting, backtesting design and backtesting implementation. We are now going to combine all of these previous tools to backtest a financial forecasting algorithm for the S&P500 US stock market index by trading on the SPY ETF...
  • Data Science @ Yummly
    In honor of Data Innovation Day 2014, in this post we’ll highlight how we collect, understand, and leverage food and recipe data at Yummly...
  • How New York’s Fire Department Uses Data Mining
    New York City has about a million buildings, and each year 3,000 of them erupt in a major fire. Can officials predict which ones will go up in flames? The New York City Fire Department thinks it can use data mining to do that. Analysts at the department say that some buildings are linked to characteristics that make them more likely to have a fire than others...
  • The Role of Algorithms in Data Visualization
    It’s somewhat surprising to me to notice how little we discuss about the more technical side of data visualization. I use to say that visualization is something that “happens in your head” to emphasize the role of perception and cognition and to explain why it is so hard to evaluate visualization. Yet, visualization happens a lot in the computer also, and what happens there can be extremely fascinating too...
  • Bayesian Bandits - Optimizing Click Throughs with Statistics
    The story is already written, now a title needs to be selected. The clever reporter who wrote the story has come up with two potential titles - “Murder victim found in adult entertainment venue” and “Headless Body found in Topless Bar”. Once upon a time, deciding which title to run was a matter for a news editor to decide. Those days are now over - geeks now rule the earth. Title selection is now primarily an algorithmic problem, not an editorial one...
  • Powering Interactive Data Analysis at Pinterest by Amazon Redshift
    In the last six month, we have set up Amazon Redshift to power our interactive data analysis at Pinterest. It has tremendously improved the speed of analyzing our data. This presentation provides details on the infrastructure, its integration process and best practices...
 
 

Jobs

 
  • Data Scientist at Fullscreen in Culver City, CA

    We are looking for a Data Scientist to join our Engineering team. If you are a hands-on data scientist who knows how to operationalize the extraction of data, iterate on development and testing of algorithms this may be the role for you. You will play a crucial role in helping build the next generation of algorithms, models and decision support systems at Fullscreen....
 
 

Training & Resources

 
  • An Idiot Learns Bayesian Analysis: Part 1

    I've done a dreadful job of reading The Theory That Would Not Die, but several weeks ago I somehow managed to read the appendix. Here the author gives a short explanation of Bayes' theorem using statistics related to breast cancer and mammogram results. It's profound in its simplicity and- for an idiot like me- a powerful gateway drug. What does it all mean, how can we extend it and what does it have to do with an underlying philosophy of Bayesian analysis (if such a thing exists)?...
  • Statistical Data Mining Tutorials

    The following links point to a set of tutorials on many aspects of statistical data mining, including the foundations of probability, the foundations of statistical data analysis, and most of the classic machine learning and data mining algorithms...
  • How to Implement a Machine Learning Algorithm
    Implementing a machine learning algorithm in code can teach you a lot about the algorithm and how it works. In this post you will learn how to be effective at implementing machine learning algorithms and how to maximize your learning from these projects...
  • 5 important Machine Learning Papers
    "If I were to read 5 papers from the last couple of years that capture the interesting/important stuff happening in ML, what would they be?" So below's my answer... They are chosen partly as reflections of where I think the field is going, and partly as reflections of where I think the field should be going. And of course the list is totally subjective and missing great papers by some of my favourite researchers: it's a personal list...
 
 
P.S. Did you enjoy the newsletter? Do you have friends/colleagues who might like it too? If so, please forward it along - we would love to have them onboard :)
 
Sign up to receive the Data Science Weekly Newsletter every Thursday

Easy to unsubscribe. No spam — we keep your email safe and do not share it.