Dataiku and Spark, a Powerful Combination

Dataiku Product Robert Kelley

As far back as Dataiku 4.0, we've included features that help data scientists make the most out of Spark using Dataiku. Dataiku and Spark combine to help users get the most out of data science. Faster computations, PySpark, Spark Scala, Spark R, and more, plus easy upgrades.

Heads Up!

This blog post is about an older version of Dataiku. See the release notes for the latest version.

Let's go >

 

Check out the video below for  ways that Dataiku makes the most out of Spark. Specifically, the video covers:

  • Dataiku's visual machine learning and how it works with Spark, along with the coding languages available (Spark Scala, PySpark, Spark R, and Spark SQL)
  • How to create multiple generic profiles on Spark via Dataiku, which allows more people in your organization to benefit from Spark
  • Spark pipelines, which are a new feature in Dataiku 4.0, and that enable much faster calculations in running Spark workflows
  • For those of you upgrading to Spark 2.x, Dataiku makes it very simple to keep all your data preparation and models just as you had them before

 

You May Also Like

From Bedside to Backend: Making Sense of Real-World Health Data

Read More

How IT Leaders Can Win the Analytics and AI Race

Read More

Dataiku Ranked #1 in Product Owner Use Case in Gartner Critical Capabilities Report

Read More

The Governance Blueprint CoEs Use to Scale Self-Service and AI Agents

Read More