Dataiku and Spark, a Powerful Combination

Dataiku Product Robert Kelley

As far back as Dataiku 4.0, we've included features that help data scientists make the most out of Spark using Dataiku. Dataiku and Spark combine to help users get the most out of data science. Faster computations, PySpark, Spark Scala, Spark R, and more, plus easy upgrades.

Heads Up!

This blog post is about an older version of Dataiku. See the release notes for the latest version.

Let's go >

 

Check out the video below for  ways that Dataiku makes the most out of Spark. Specifically, the video covers:

  • Dataiku's visual machine learning and how it works with Spark, along with the coding languages available (Spark Scala, PySpark, Spark R, and Spark SQL)
  • How to create multiple generic profiles on Spark via Dataiku, which allows more people in your organization to benefit from Spark
  • Spark pipelines, which are a new feature in Dataiku 4.0, and that enable much faster calculations in running Spark workflows
  • For those of you upgrading to Spark 2.x, Dataiku makes it very simple to keep all your data preparation and models just as you had them before

 

You May Also Like

The Foundation for Agent Success: The Modern AI Architecture

Read More

From Chaos to Control: End Shadow Analytics With Dataiku

Read More

8 Ways Dataiku Helps IT Teams Overcome Infrastructure Barriers

Read More

Introducing: The Dataiku Code Agentifier™

Read More