Dataiku and Spark, a Powerful Combination

Dataiku Product Robert Kelley

As far back as Dataiku 4.0, we've included features that help data scientists make the most out of Spark using Dataiku. Dataiku and Spark combine to help users get the most out of data science. Faster computations, PySpark, Spark Scala, Spark R, and more, plus easy upgrades.

Heads Up!

This blog post is about an older version of Dataiku. See the release notes for the latest version.

Let's go >

 

Check out the video below for  ways that Dataiku makes the most out of Spark. Specifically, the video covers:

  • Dataiku's visual machine learning and how it works with Spark, along with the coding languages available (Spark Scala, PySpark, Spark R, and Spark SQL)
  • How to create multiple generic profiles on Spark via Dataiku, which allows more people in your organization to benefit from Spark
  • Spark pipelines, which are a new feature in Dataiku 4.0, and that enable much faster calculations in running Spark workflows
  • For those of you upgrading to Spark 2.x, Dataiku makes it very simple to keep all your data preparation and models just as you had them before

 

You May Also Like

Alteryx to Dataiku: Best of 2024

Read More

Frende Forsikring: Simplifying Claims Reporting for Customers

Read More

From LLM Mess to LLM Mesh: Building Scalable AI Applications

Read More

5 Challenges to Modern Data Insights

Read More