The Dataiku Blog

How do Social Media Publishing Optimization Tools Work ?

Marketers and community managers who work with social networks are often tempted to use social publishing optimization tools. Yet, the recipe behind such tools is often obscure for non-data...

marketing, Technology, business | July 03, 2014 | Florian Douetteau

Easy Text Clustering

Working on text-based datasets is a different world to dealing with numbers. Comparisons between words are much harder and it may be difficult to group or aggregate similar values. Let's find out...

data science, tutorial, clustering | June 24, 2014 | Jeremy Greze

When is ‘Big Data’ Really ‘Big’?

As I’m involved in the startup ecosystem, I meet a lot of startups who work with data. Every week, people ask me the same question: “Is the volume of data I’m dealing with big enough to be called...

Opinion, data science, connected sensors | June 18, 2014 | Florian Douetteau

Dataiku Data Science Studio (DSS) v. 1.1 Brings More Collaboration

At Dataiku we work everyday to make data science more accessible to everyone with our Data Science Studio (DSS). Today, we are proud to announce the release of Data Science Studio V1.1 (Yuzu) , a...

Product, Corporate, announcement | June 04, 2014 | Clément Stenac

Part 2 of My Non Sinking (or How I Ran My First Predictive Model)

Running predictive models is pretty easy in the Data Science Studio. In a few clicks, you have the ability to predict a variable from the data you have. Once you have tried, you will never see...

Opinion, Technology, machine learning | May 20, 2014 | Jeremy

A Marketing Guy's First Steps in Data Science With the Titanic Kaggle

As a marketeer, I had quite a lot of experience using Excel but never really ran predictive models.Find out how I used the studio to predict survival from the sinking of the Titanic.

Data Science Basics, tutorial, Data analysis | May 12, 2014 | Jeremy Greze

Recommendation Engine: There Is No Silver Bullet

As Internet users, every day, we receive many offers for multiple products. Robots send them to us through various communication channels. How do advertisers choose which products to show and...

recommendation, marketing, business | April 24, 2014 | Florian Douetteau

Dataiku at the Hadoop Summit

On April 2-3 took place the Hadoop Summmit Europe, a two-day event about the Apache Hadoop community, in Amsterdam. I gave a talk about “Semi-Supervised learning applied to understanding customer...

Hadoop, Corporate, Events | April 14, 2014 | Florian Douetteau

Coming Back from Big Data Paris

The third edition of Big Data Paris summit took place on April 1st and 2nd 2014. Big Data Paris is the major French event about the big data ecosystem. Many important companies and talented people...

Product, Corporate, Events | April 04, 2014 | Jeremy Greze

Winning Kaggle: An introduction to Re-Ranking

Our regular readers are probably familiar with Kaggle and its machine learning contests. If you're new to Kaggle  you can read this article , "A Kaggle Data Science Competition Made Easy"  to get...

recommendation, data science, machine learning | January 14, 2014 | Paul-Henri Hincelin

Beyond the Hype: the 6 Core Skills of a Good Data Scientist

The term “data scientist” was coined in 2008 by two LinkedIn analysts to describe their work deriving business value from the masses of data being generated by their website. Since then, people...

Opinion, Data Science Basics, business | November 10, 2013 | Florian Douetteau

Machine Learning for Merchandising

The other day, I was talking to my friend who runs a personal e-commerce site selling speciality items. He wanted to understand how machine learning technologies could help him manage his site...

business, machine learning, predictive analytics | October 24, 2013 | Florian Douetteau

Dataiku at Berlin Buzzwords 2013, Part 1

I had the chance to attend and speak at this year's Berlin Buzzwords conference last week, dedicated to the topics of search, scale and storage.

Corporate, Technology, Events | June 13, 2013 | Clément Stenac

Berlin Buzzwords Part 2: Introducing Dataiku Flow and dctc

As I mentioned in my previous post, I also had a chance to talk about what we've been up to over here at Dataiku during my stay in Berlin:

  • Dataiku Flow, the next-generation data pipeline...
Events | June 13, 2013 | Clement

The New Search : Fuzzy, Instantaneous, and Local

At Dataiku, we use extensively search logs and associated navigation information for user behaviour analytics and relevance optimization. Most of our customers today use SOLR or ElasticSearch....

data science, Technology | May 03, 2013 | Florian Douetteau

A Complete Guide to Writing Hive UDF

Note that this guide is quite old (it was written when Hive was at version 0.10) and might not apply as-is to recent Hive releases. Use at your own risk :)

Dataiku DSS provides deep integration...

Hadoop, data science, Technology | April 30, 2013 | Clement

Kaggle Contest: Blue Book For Bulldozers

Perhaps you know Kaggle and its slogan “making data science a sport”?

Kaggle is a cool platform for predictive modeling competitions where the best data scientists face each other, all trying to...

data science, machine learning, python | April 25, 2013 | Matt Scordia

Thomas at Strata - Part 2

The previous post on my trip to Strata describes my first day there. You may want to read it here.

The next two days were focused on keynotes and presentations, as well as exhibitors products...

Corporate, Events, strata | March 20, 2013 | Thomas Cabrol

Thomas at Strata - Part 1

I've been lucky enough to travel to Santa Clara, California, and attend the Strata Conf event. I was there for two days and have plenty of insight and feedback on all the sessions over the course...

Corporate, Events | March 12, 2013 | Thomas Cabrol

Visualizing Your LinkedIn Graph Using Gephi - Part 1

Graph analysis becomes a key component of data science. A lot of things can be modeled as graphs, but social networks are really one of the most obvious examples.

In this post, I am going to show how...

Technology | December 17, 2012 | Thomas Cabrol
Page 23 of 24