At Dataiku, we work every day to make data science more accessible to everyone with Dataiku DSS. Today, we are proud to announce the release of Dataiku DSS V1.1 (Yuzu), a major step forward for all data analysis projects.
Heads Up!
This blog post is about an older version of Dataiku. See the release notes for the latest version.
This update contains some of the most frequently requested features from our users, as well as some that have long been in the pipe:
Projects and Collaboration
Once you start using Dataiku, you won't want to stop after your first data project. Dataiku now features a brand new project management system. Each project is a separate working environment, containing its datasets, transformation recipes, and predictive models. Projects let you organize your workflow and handle multiple levels of access rights (readers, analysts, and administrators).
You will also discover new collaboration tools — tags, timelines, comments, and notifications — that let you work efficiently with your team. You can now share comments, charts, datasets, notebooks, and custom HTML visualizations with your colleagues and pin them on a board accessible to anyone.
New Machine Learning Guided Interface
Our business users like the way they can create predictive models within the graphical interface without coding. For them, we refreshed the interface to do so. It supports a vast choice of algorithms and parameters. You now have the ability to perform many runs and easily compare them.
Text Mining Tools
Text mining and analysis require special tools to be effective. We included new processors to our interface to help you transform and enrich your text-based datasets in a few clicks:
-
Text simplifications to merge words or queries that should be considered the same
-
Extractions of sequences of words, called ngrams, from text to a new column
-
Fuzzy left join on your data to match lines from different datasets even if the two strings being matched are not exactly equal, but close
Enhanced Connectivity
We keep adding new connectors to our platform to let you work on top of all the major databases or social services. In Dataiku DSS V1.1, we are happy to announce support for Impala, MongoDB, and Twitter.
The Twitter connectivity lets you capture keywords or hashtags and save the tweets directly within Dataiku.
With the MongoDB connector, you will benefit from the simplicity and the power of use of this NoSQL database to manage your documents and perform analyses.
Other Improvements and Bug Fixes
The user experience has been generally improved in this new version. We want to provide the best usability even when you deal with thousands of databases and millions of records. Our data scientists work every day with our solution, so we are sure to have good feedback.