Dataiku and Spark, a Powerful Combination

Dataiku Product Robert Kelley

As far back as Dataiku 4.0, we've included features that help data scientists make the most out of Spark using Dataiku. Dataiku and Spark combine to help users get the most out of data science. Faster computations, PySpark, Spark Scala, Spark R, and more, plus easy upgrades.

Heads Up!

This blog post is about an older version of Dataiku. See the release notes for the latest version.

Let's go >

 

Check out the video below for  ways that Dataiku makes the most out of Spark. Specifically, the video covers:

  • Dataiku's visual machine learning and how it works with Spark, along with the coding languages available (Spark Scala, PySpark, Spark R, and Spark SQL)
  • How to create multiple generic profiles on Spark via Dataiku, which allows more people in your organization to benefit from Spark
  • Spark pipelines, which are a new feature in Dataiku 4.0, and that enable much faster calculations in running Spark workflows
  • For those of you upgrading to Spark 2.x, Dataiku makes it very simple to keep all your data preparation and models just as you had them before

 

You May Also Like

From Vision to Value: Visual GenAI in Dataiku

Read More

Data Preparation Dataiku Hidden Gems: Part 2

Read More

Maximizing Enterprise Data Products Distribution

Read More

AI Isn't Taking Over, It's Augmenting Decision-Making

Read More