Introducing PolYamoR, the Two-Way Translator Between Python and R

Dataiku Company Florian Douetteau

Today, data science is the field of a Big War between Python and R.

**Warning! This article was posted as an April Fool's day joke. Our team is still working on making PolYamoR a reality ;) **

As we pointed out in a previous post, “Data Science, Monogamy or Menage à 3 there are ways to make different languages cohabit in data science.

But cohabitation has its own rules, and even if Dataiku DSS can make it smooth, not everyone is ready for it (yet) .

The Long Awaited Solution to Everything  

After one year of intense development, we are proud to open source PolYamoR,  the first forward and reverse-automated R to Python converter and translation system. PolYamoR is the first multi-lingual translation system that enables full transparency, no ambiguity, and manages all of the edge cases of complex programming. PolYamoR can translate plain Python into plain R and vice versa, leading to an unexpected new era of  conversations between cultures. 

The source code is available on GitHub today.

Modern translation systems rely on deep learning in order to achieve their performance. Of course, PolYamoR is no exception.

We trained PolYamoR by providing millions of lines and Python, millions of lines of R, and their respective translation, training a recurrent neural network.

Of course, the very first translation was  crude:

PolyamoR first translation code notebook

AMAZING GPU Clusters at Work 

After thousands of hours of training, involving a 20 nodes clusters with dozen of GPUs, PolYAmoR managed to produce, clean, manageable code (even in R). The code generated by the tool can be very long though:

Dataiku PolYAmoR code R to Python converter

Lost in Translation? 

PolYamoR was originally written in Python, but after a programming error by a team member on Friday night , the program decided to translates itself in R. After a change of mind, PolYamoR is now half Python, half R, and stable enough for production use. 

We are confident that PolYamoR — a full R to Python converter — will change the way data science teams collaborate on a day-to-day basis. Maybe some day a lingua franca will emerge across the layers of the system and will unify the work of all data scientists. But in the meantime, get some fresh R if you want, bite the Python if you like, and keep having fun with data science until next year! 

You May Also Like

Improving Model Accuracy and Predictability With Reinforcement Learning

Read More

Dataiku's Commitment to Measuring and Offsetting Our Environmental Impact

Read More

The Forrester Wave Pegs Dataiku a Leader

Read More

The Human-Centered AI Conference, On Air

Read More