The Dataiku Blog

Combining Human Knowledge with Machine Learning for Robust Data Flows

Even if you’re working with 100% machine-created data, more than likely you’re performing some amount of manual inspection on your data at different points in the data analysis process, and the...

data science, Technology, machine learning | April 14, 2016 | Robert Dempsey

Top 12 Secret Shortcuts of Dataiku DSS

Here is a post for power users, people who want to make the most out of Dataiku DSS. The following are undocumented keyboard shortcuts subject to change. But some of them are still darn useful, so...

data science, Technology, data | April 11, 2016 | Jean-Baptiste Rouquier

Investigating Geographic Response Rates of 311 Calls

Government data can often reveal suprising insights about the way communities are served. Here, I used Dataiku DSS to analyze data about 311 calls in New York City. My goal was to determine...

data science, Technology, tutorial | April 04, 2016 | Jed Dougherty

Three Paths to Updating Your Data Technology

Data science departments often use older technologies that were in place when they launched. But the new data scientist generation is using newer technologies such as R, Python, etc. How can you...

data science, Technology, collaboration | March 30, 2016 | Romain

Automation Scenarios: Another Step Towards Successful Model Deployment

I - probably like many before me - like to think of data science (and more generally of big data) as a process of a final outcome: deployment into production. The thing is, putting a model to...

Production, Technology, Data Engineering | March 29, 2016 | Margot

Advice From John Kelly: Preparing for Data Science Adoption (Part II)

This is the second part of my interview with John Kelly where he explains the most common challenges in terms of organization and why big data investment has not yet impacted companies at scale. ...

Interview, Technology, business | February 25, 2016 | Caroline Martre

Advice from John Kelly: Preparing for Data Science Adoption

When building or growing data science teams, companies often face a noisy world. As I was trying to identify the group dynamic in terms of pains and challenges, I came across an article from John...

Interview, Technology, business | February 22, 2016 | Caroline Martre

Modern Data Science: Monogamy or Ménage à Trois?

What I call monogamy in a technological environment is to remain faithful to only one development language. So yes, I know you’re thinking coding and being married (or in a relationship) are two...

organization, Technology, business | February 12, 2016 | Lara Khanafer

Building a Data Pipeline to Clean Dirty Data

A data pipeline is a series of steps that your data moves through. The output of one step in the process becomes the input of the next. Data, typically raw data, goes in one side, goes through a...

Data Preparation, Technology, Data analysis | February 10, 2016 | Robert Kelley

Telling Stories With Data Visualization by Matt Daniels from Polygraph

Before I had ever even heard about Dataiku and started really working around data, I remember reading this awesome article around March 2015 on the ranking of hip hop artists based on their...

Interview, Data Visualization, Technology | January 27, 2016 | Alivia Smith

The Consequences Of The Data Revolution On Infrastructure Management

Today, a single innovation can put a market upside down in a couple of days. Most innovative products in the decade are data-driven and we are beginning to see the benefits of creating ‘data...

Product, Technology, business | January 19, 2016 | Joel Belafa

Christmas Data and Topic Modeling

Thanks to our amazing team of data scientists, after a bit of cleaning and hours of modeling based on historical data spreading over 2015 years, we have determined that Christmas 2015 would happen...

Data Preparation, data science, Technology | December 25, 2015 | Leo Dreyfus-Schmidt

How Data Analytics Can Improve Healthcare Daily Practices [Part 3]

This is the third and last part of my interview with Doctor Martin Pusic. In part one of the talk, he explained how data analytics can improve clinicians’ daily practices. In part two, he defined...

Interview, data science, Technology | December 22, 2015 | Romain Doutriaux

Interview With Helena Edelson: Spark and Stream Processing

In the big data engineering world, Spark and stream processing are the words on everybody's lips these days. Here are a few words of wisdom by somebody why actually works with Spark everyday, the...

Interview, spark, Technology | November 22, 2015 | Alivia Smith

The Long Journey Across Technoslavia

   

My grandmother was born in a country called Yugoslavia. As you may know, Yugoslavia was one country with seven borders, six republics, five nationalities, four languages, three religions, and...

Opinion, Corporate, Technology | November 18, 2015 | Florian Douetteau

Dataiku Data Science Studio and MongoDB

There’s a new sheriff in town and his name is Mongo! MongoDB, that is. And this guy is doing things differently… no tables and no relational structure here. MongoDB is all about dynamic schemas,...

Technology | November 16, 2015 | Brian

World Geopolitics through Hillary Clinton's emails

Today's subject is natural language processing with a dataset of choice: Hillary Clinton's emails. These emails are a hot topic in the current Democrat primaries. In this blog post, we will use...

data science, Technology, machine learning | October 25, 2015 | Hanna Julienne

Data Science Studio for Biotech 1: Parsing VCF files

Hi everyone, I'm Eric, a data scientist here at Dataiku. I'm going to be doing a series of blog posts about using Data Science Studio (DSS) in biotech. So stay tuned and check back for more...

Data Preparation, data science, Technology | October 14, 2015 | Eric Kramer

How a Data Science Company Keeps its Data Scientists Happy

Good data scientists are hard to find. Even harder to keep. I know this for a fact since hiring / finding them was my job for four years. But, at Dataiku, they stay. Why? That’s what the following...

Corporate, Technology, business | October 12, 2015 | Lara Khanafer

Ending the R vs Python war in DSS [Video]

On September 09th, Eric Kramer (data scientist at Dataiku) presented our sixth Free Training Webinar, "Ending the R vs Python war in DSS".

data science, Technology, tutorial | September 09, 2015 | Thomas Thus
Page 2 of 4