The Dataiku Blog

Robert Kelley

Find me on:

Recent Posts

Watch a Clustering Algorithm Work in the Wild

Previously, we’ve discussed regression and classification algorithms, and now that you're an expert in these types of supervised learning, it's time to delve into unsupervised learning....

Data Science Basics, machine learning, clustering | August 16, 2017 | Robert Kelley

The 4 Trends That Are Upending the Analyst World

You don’t need us to tell you that the data world – and everything it touches, which is, like, everything – is changing rapidly. These trends are driving the opportunities that will fuel your...

Opinion, Data analysis, Team | August 14, 2017 | Robert Kelley

Deployment to Production is Organizational (not Just Technical)

We recently shared how we at Dataiku approach and facilitate data analytics in production, and we have one more angle to take on it: the human angle.

Production, data science, architecture | August 07, 2017 | Robert Kelley

How to Become the Analyst of the Future: Choose Your Own Adventure

For business and data analysts, tools and goals have been shifting rapidly for a few years now. An explosion in data means keeping up with a steep learning curve, and increased competition means...

Data Science Basics, Data Preparation, Data analysis | August 01, 2017 | Robert Kelley

Production is Where Your Analytics is Really the Star

Imagine if you and your team spent months putting together a powerful analytics or data science model with great insights, but nobody ever used the dashboards or automated decisions with it. It...

Production, data science, IT | July 24, 2017 | Robert Kelley

Get to Know NYC and Paris from the Point of View of an Algorithm

When you walk around a city, you get a pretty good idea of how neighborhoods compare. You might start in a residential neighborhood and then walk to a busy area with lots of bars and restaurants...

Data Visualization, Product, clustering | July 17, 2017 | Robert Kelley

Want a Robust Model? Try These Validation Strategies [Infographic]

When evaluating machine learning models, the validation step helps you find the best parameters for your model while also preventing it from becoming overfitted. Two of the most popular strategies...

Data Science Basics, machine learning, Data analysis | July 12, 2017 | Robert Kelley

Dataiku 4.0.5 Is the Between-Album Single You've Been Waiting For

In the rap world, it's common for an artist to drop a mix tape in between proper albums as a sort of appetizer for the main course. And some of these mix tapes wind up being really, really good....

Product, Corporate, release | June 30, 2017 | Robert Kelley

How to Figure Out if Your Model Sucks

We’ve spoken before about basic concepts in data science and machine learning algorithms, and so by this point, you might have already created a machine learning model. But now the question is:...

Data Science Basics, machine learning, Infographic | June 15, 2017 | Robert Kelley

Hadoop Security Basics (In Under 5 Minutes)

With data and analytics teams growing larger and working on more collaborative projects, security and governance are becoming bigger issues. 

Hadoop, Technology, Data Engineering | May 25, 2017 | Robert Kelley

Doing Data Science with Dataiku and Vertica

Many Dataiku customers use Vertica, the powerful SQL database management software, as the engine on which to run Dataiku. This makes sense, because not only is it easy to get started with Dataiku...

data science, SQL, Vertica | May 10, 2017 | Robert Kelley

You Need Open Source Technologies, but They're Not Easy

The open source software movement didn’t start with data science solutions, but it sure has carried forward many of the biggest advancements in data science in the past decade or so. Starting with...

Hadoop, Product, Technology | May 05, 2017 | Robert Kelley

Why You Should Use Kafka Even Though You Might Not Need It (Yet)

You’ve probably heard quite a bit about Apache Kafka, the leading system for storing streaming data that LinkedIn open sourced back in 2011. Apache Kafka is an extremely powerful and scalable...

data science, predictive analytics, architecture | April 18, 2017 | Robert Kelley

Video: Collaboration with Dataiku

"Collaboration" is quite the buzzword these days, but at Dataiku, it's at the very foundation of our product. Projects in Dataiku are designed to support multiple users by default.  

Product, business, collaboration | March 27, 2017 | Robert Kelley

An Introduction to Key Data Science Concepts

Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they...

Data Science Basics, machine learning, Infographic | March 09, 2017 | Robert Kelley

Upcoming Webinar: How Dataiku works with Microsoft HDInsight

Dataiku’s integration with HDInsight, the fully-managed Hadoop platform from Microsoft, is a great example of the openness with which Dataiku was built. This versatility means that it is easy to...

Partnership, webinar, business | February 28, 2017 | Robert Kelley

Dataiku 4.0 Is Out Now: True Scalable Collaboration!

Today we are announcing the release of Dataiku DSS 4.0, which introduces new functionalities that improve the production, development, and management of large-scale data science projects. 

Product, Corporate, announcement | February 23, 2017 | Robert Kelley

Video: Dataiku and Spark, a Powerful Combination

Today is the first day of the Spark Summit East 2017 in Boston, and just in time, we have a brand new video showing why Dataiku and Spark are such a powerful combination.

spark | February 07, 2017 | Robert Kelley

Machine Learning Explained: Algorithms Are Your Friend

We hear the term “machine learning” a lot these days, usually in the context of predictive analysis and artificial intelligence. Machine learning is, more or less, a way for computers to learn...

Data Science Basics, machine learning, Data analysis | January 19, 2017 | Robert Kelley

Working with Dates in Excel Is Frustrating

If you’ve spent much time working with Excel, you know that things can get really frustrating when you are (or Excel thinks you are) working with dates. In its efforts to be helpful, it often...

big data, data science, Technology, data | December 22, 2016 | Robert Kelley
Page 1 of 2