The Dataiku Blog

Robert Kelley

Find me on:

Recent Posts

How to Figure Out if Your Model Sucks

We’ve spoken before about basic concepts in data science and machine learning algorithms, and so by this point, you might have already created a machine learning model. But now the question is:...

Data Science Basics, machine learning, Infographic | June 15, 2017 | Robert Kelley

Hadoop Security Basics (In Under 5 Minutes)

With data and analytics teams growing larger and working on more collaborative projects, security and governance are becoming bigger issues. 

Hadoop, Technology, Data Engineering | May 25, 2017 | Robert Kelley

Doing Data Science with Dataiku and Vertica

Many Dataiku customers use Vertica, the powerful SQL database management software, as the engine on which to run Dataiku. This makes sense, because not only is it easy to get started with Dataiku...

data science, SQL, Vertica | May 10, 2017 | Robert Kelley

You Need Open Source Technologies, but They're Not Easy

The open source software movement didn’t start with data science solutions, but it sure has carried forward many of the biggest advancements in data science in the past decade or so. Starting with...

Hadoop, Product, Technology | May 05, 2017 | Robert Kelley

Why You Should Use Kafka Even Though You Might Not Need It (Yet)

You’ve probably heard quite a bit about Apache Kafka, the leading system for storing streaming data that LinkedIn open sourced back in 2011. Apache Kafka is an extremely powerful and scalable...

data science, predictive analytics, architecture | April 18, 2017 | Robert Kelley

Video: Collaboration with Dataiku

"Collaboration" is quite the buzzword these days, but at Dataiku, it's at the very foundation of our product. Projects in Dataiku are designed to support multiple users by default.  

Product, business, collaboration | March 27, 2017 | Robert Kelley

An Introduction to Key Data Science Concepts

Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they...

Data Science Basics, machine learning, Infographic | March 09, 2017 | Robert Kelley

Upcoming Webinar: How Dataiku works with Microsoft HDInsight

Dataiku’s integration with HDInsight, the fully-managed Hadoop platform from Microsoft, is a great example of the openness with which Dataiku was built. This versatility means that it is easy to...

Partnership, webinar, business | February 28, 2017 | Robert Kelley

Dataiku 4.0 Is Out Now: True Scalable Collaboration!

Today we are announcing the release of Dataiku DSS 4.0, which introduces new functionalities that improve the production, development, and management of large-scale data science projects. 

Product, Corporate, announcement | February 23, 2017 | Robert Kelley

Video: Dataiku and Spark, a Powerful Combination

Today is the first day of the Spark Summit East 2017 in Boston, and just in time, we have a brand new video showing why Dataiku and Spark are such a powerful combination.

spark | February 07, 2017 | Robert Kelley

Machine Learning Explained: Algorithms Are Your Friend

We hear the term “machine learning” a lot these days, usually in the context of predictive analysis and artificial intelligence. Machine learning is, more or less, a way for computers to learn...

Data Science Basics, machine learning, Data analysis | January 19, 2017 | Robert Kelley

Working with Dates in Excel Is Frustrating

If you’ve spent much time working with Excel, you know that things can get really frustrating when you are (or Excel thinks you are) working with dates. In its efforts to be helpful, it often...

big data, data science, Technology, data | December 22, 2016 | Robert Kelley

Become a Tableau Power User with Dataiku DSS

Our goal here at Dataiku is to help people everywhere grow their expertise and confidence with data. A vital part of this is being able to integrate easily with Tableau, making you a Tableau Power...

Data Visualization, big data, data science | November 06, 2016 | Robert Kelley

Dataiku and Microsoft: A Complete Data Analytics Platform

One of the advantages to the Microsoft ecosystem is just how comprehensive it is. From Azure to HDInsight to Power BI, there are many ways Microsoft can enhance your data analytics experience. And...

Data Preparation, tutorial, Data analysis | October 20, 2016 | Robert Kelley

Building a Data Pipeline to Clean Dirty Data

A data pipeline is a series of steps that your data moves through. The output of one step in the process becomes the input of the next. Data, typically raw data, goes in one side, goes through a...

Data Preparation, Technology, Data analysis | February 10, 2016 | Robert Kelley
Page 1 of 1