The Dataiku Blog

SQL, R, and Python: Why Data Wrangling in ONLY Code is Inefficient

So everyone knows the oh-so-popular statement that a data scientist spends 50 to 80% of his time cleaning and preparing his data before he even starts looking for insights in it. I mean everyone’s...

Opinion, Data Preparation, data science | February 24, 2016 | Alivia Smith

Advice from John Kelly: Preparing for Data Science Adoption

When building or growing data science teams, companies often face a noisy world. As I was trying to identify the group dynamic in terms of pains and challenges, I came across an article from John...

Interview, Technology, business | February 22, 2016 | Caroline Martre

15min to Understand How Predictive Analytics Could Save Healthcare

In this podcast, Intrepid Editor-in-Chief, Joe Lavelle interviews Eric Kramer, data scientist at Dataiku. In this 15min episode, Eric explains how predictive analytics could help healthcare...

healthcare, data science, predictive analytics | February 21, 2016 | Pauline Brown

Modern Data Science: Monogamy or Ménage à Trois?

What I call monogamy in a technological environment is to remain faithful to only one development language. So yes, I know you’re thinking coding and being married (or in a relationship) are two...

organization, Technology, business | February 12, 2016 | Lara Khanafer

Building a Data Pipeline to Clean Dirty Data

A data pipeline is a series of steps that your data moves through. The output of one step in the process becomes the input of the next. Data, typically raw data, goes in one side, goes through a...

Data Preparation, Technology, Data analysis | February 10, 2016 | Robert Kelley

Data Analysis Reveals the True Nature of Peer-Reviewed Journals

Pierre Bourdieu first made a very strong impression on me when I was just a college student. Not only did he have a funny-sounding name, at least to a French ear, but he was one of the most...

Data Visualization, data science, Data analysis | February 10, 2016 | Leo Dreyfus-Schmidt

Sky Diving… For The Second Time

Fifteen years ago, as part of the HEC Entrepreneurs Master’s integration program, I had my first skydiving experience. I remember a mix of fear and excitement, climaxing in total fear when, at...

Corporate, announcement | January 28, 2016 | Carole Offredo

Telling Stories With Data Visualization by Matt Daniels from Polygraph

Before I had ever even heard about Dataiku and started really working around data, I remember reading this awesome article around March 2015 on the ranking of hip hop artists based on their...

Interview, Data Visualization, Technology | January 27, 2016 | Alivia Smith

Merging Data Sources to Investigate Student Loan Debt

Many millennials in the United States are burdened with high student loan debt. Outstanding student loan debt in the United States was at a massive $1.19 trillion as of June 2015. 

Data Visualization, data science, machine learning | January 25, 2016 | Jed Dougherty

The Consequences Of The Data Revolution On Infrastructure Management

Today, a single innovation can put a market upside down in a couple of days. Most innovative products in the decade are data-driven and we are beginning to see the benefits of creating ‘data...

Product, Technology, business | January 19, 2016 | Joel Belafa

A User Marketer Asks: Why Is Nobody Talking About User Marketing?

Everyone in the startup universe knows that everything they do should be centered on the user. So why isn’t "user marketing" a thing yet? Is it because no one is doing it?

Users, Opinion, Corporate | January 19, 2016 | Alivia Smith

Put Your Data to Work in Health Care

Quantified self, population health, patient engagement, telehealth, interoperability… the health care IT industry is buzzing with plenty of opportunities but is missing a few basic standards to...

health care, business | January 14, 2016 | Romain Doutriaux

How Can the Healthcare Industry Optimize Scheduling with Data?

I have a definite phobia about doctor's waiting rooms. This phobia came about fairly recently, after contracting chickenpox, a disease I attributed to spending some time in a pediatrician's...

healthcare, business, predictive analytics | January 08, 2016 | Florian Douetteau

A Recap of our Student Data Science Challenge with Datalyo in Lyon

On November 20th, we organized a Data Science Challenge in Lyon with our partner Datalyo, a data-driven consulting firm. The goal was to bring together 50 engineering and business school students...

Partnership, Corporate, Events | December 31, 2015 | Vincent de Stoecklin

Christmas Data and Topic Modeling

Thanks to our amazing team of data scientists, after a bit of cleaning and hours of modeling based on historical data spreading over 2015 years, we have determined that Christmas 2015 would happen...

Data Preparation, data science, Technology | December 25, 2015 | Leo Dreyfus-Schmidt

How Data Analytics Can Improve Healthcare Daily Practices [Part 3]

This is the third and last part of my interview with Doctor Martin Pusic. In part one of the talk, he explained how data analytics can improve clinicians’ daily practices. In part two, he defined...

Interview, data science, Technology | December 22, 2015 | Romain Doutriaux

Health Care Analytics: Data-Driven Scheduling to Reduce No-Shows

The health care industry suffers from a data problem. There is an abundance of raw data in health care, but no one really knows what to do with it. The good news is that all of this data can be...
health care, business | December 16, 2015 | Romain Doutriaux

How Data Analytics Can Improve Healthcare Daily Practices [Part 2]

This is the second part of my interview with Doctor Martin Pusic. In part one of the talk, he explained how data analytics can improve clinicians’ daily practices. Today, he defines how a proper...

Interview, healthcare, data science | December 14, 2015 | Romain Doutriaux

Making Deploying a Predictive Application Into Production Easy [Video]

This is a video of our data scientist Matthieu Scordia demonstrating how to build, deploy, and run a predictive application in under 15 minutes. Challenge accepted! 

video, machine learning, tutorial | December 08, 2015 | Thomas Thus

A Kaggle Data Science Competition Made Easy

At Dataiku, every new member of the team, either marketer or superstar data scientist, has to try DSS out with the Titanic Kaggle Competition. It's such a milestone in the company that our first...

data science, machine learning, tutorial | December 08, 2015 | Alivia Smith
Page 13 of 19