The Dataiku Blog

Photoshop for Data Science... What Does That Even Mean?

Dataiku has gotten quite a bit of press recently, including an interesting article entitled The Photoshop for Data Scientists. This blog post is an explanation of why we believe this metaphor...

Product, Visual Data Analysis, business | November 04, 2014 | Pauline Brown

Some Feedback on Strata + Hadoop World New York

We were a proud Sponsor of the Strata Hadoop World Conference 2014 in the beautiful city of Manhattan, NYC.

Corporate, Events, Team | October 20, 2014 | Pauline Brown

Working with External Data and APIs in Data Science Studio

The Data Science Studio makes it possible to create an end-to-end analytical workflow using APIs and services from different providers.

Visual Data Analysis, data science, tutorial | October 15, 2014 | Thomas Cabrol

Pronouncing Dataiku

Dataiku isn’t easy to pronounce but it certainly manages to break the ice.

marketing, Corporate, business | August 07, 2014 | Pauline Brown

Dataiku "Shiso" Release and Community Edition

We are very proud to announce today the release of Dataiku Data Science Studio (DSS) V 1.2, codenamed Shiso. With this release, we now offer a free version: Data Science Studio Community Edition !

Product, Corporate, announcement | July 21, 2014 | Clément Stenac

Join Datasets with Approximate String Matching: Movie Titles

When working with text values provided by real users, you must deal with various approximations or typing errors. Let's have a look at how Data Science Studio makes dealing with a list of...

data science, Technology | July 17, 2014 | Jeremy Greze

How do Social Media Publishing Optimization Tools Work ?

Marketers and community managers who work with social networks are often tempted to use social publishing optimization tools. Yet, the recipe behind such tools is often obscure for non-data...

marketing, Technology, business | July 03, 2014 | Florian Douetteau

Easy Text Clustering

Working on text-based datasets is a different world to dealing with numbers. Comparisons between words are much harder and it may be difficult to group or aggregate similar values. Let's find out...

data science, tutorial, clustering | June 24, 2014 | Jeremy Greze

When is ‘Big Data’ Really ‘Big’?

As I’m involved in the startup ecosystem, I meet a lot of startups who work with data. Every week, people ask me the same question: “Is the volume of data I’m dealing with big enough to be called...

Opinion, data science, connected sensors | June 18, 2014 | Florian Douetteau

Dataiku Data Science Studio (DSS) v. 1.1 Brings More Collaboration

At Dataiku we work everyday to make data science more accessible to everyone with our Data Science Studio (DSS). Today, we are proud to announce the release of Data Science Studio V1.1 (Yuzu) , a...

Product, Corporate, announcement | June 04, 2014 | Clément Stenac

Part 2 of My Non Sinking (or How I Ran My First Predictive Model)

Running predictive models is pretty easy in the Data Science Studio. In a few clicks, you have the ability to predict a variable from the data you have. Once you have tried, you will never see...

Opinion, Technology, machine learning | May 20, 2014 | Jeremy

A Marketing Guy's First Steps in Data Science With the Titanic Kaggle

As a marketeer, I had quite a lot of experience using Excel but never really ran predictive models.Find out how I used the studio to predict survival from the sinking of the Titanic.

Data Science Basics, tutorial, Data analysis | May 12, 2014 | Jeremy Greze

Recommendation Engine: There Is No Silver Bullet

As Internet users, every day, we receive many offers for multiple products. Robots send them to us through various communication channels. How do advertisers choose which products to show and...

recommendation, marketing, business | April 24, 2014 | Florian Douetteau

Dataiku at the Hadoop Summit

On April 2-3 took place the Hadoop Summmit Europe, a two-day event about the Apache Hadoop community, in Amsterdam. I gave a talk about “Semi-Supervised learning applied to understanding customer...

Hadoop, Corporate, Events | April 14, 2014 | Florian Douetteau

Coming Back from Big Data Paris

The third edition of Big Data Paris summit took place on April 1st and 2nd 2014. Big Data Paris is the major French event about the big data ecosystem. Many important companies and talented people...

Product, Corporate, Events | April 04, 2014 | Jeremy Greze

Winning Kaggle: An introduction to Re-Ranking

Our regular readers are probably familiar with Kaggle and its machine learning contests. If you're new to Kaggle  you can read this article , "A Kaggle Data Science Competition Made Easy"  to get...

recommendation, data science, machine learning | January 14, 2014 | Paul-Henri Hincelin

Beyond the Hype: the 6 Core Skills of a Good Data Scientist

The term “data scientist” was coined in 2008 by two LinkedIn analysts to describe their work deriving business value from the masses of data being generated by their website. Since then, people...

Opinion, Data Science Basics, business | November 10, 2013 | Florian Douetteau

Machine Learning for Merchandising

The other day, I was talking to my friend who runs a personal e-commerce site selling speciality items. He wanted to understand how machine learning technologies could help him manage his site...

business, machine learning, predictive analytics | October 24, 2013 | Florian Douetteau

Dataiku at Berlin Buzzwords 2013, Part 1

I had the chance to attend and speak at this year's Berlin Buzzwords conference last week, dedicated to the topics of search, scale and storage.

Corporate, Technology, Events | June 13, 2013 | Clément Stenac

Berlin Buzzwords Part 2: Introducing Dataiku Flow and dctc

As I mentioned in my previous post, I also had a chance to talk about what we've been up to over here at Dataiku during my stay in Berlin:

  • Dataiku Flow, the next-generation data pipeline...
Events | June 13, 2013 | Clement
Page 11 of 12