Analyzing the Irish Job Market with Dataiku and import.io

Use Cases & Projects Thomas Thus

Getting value from your data is not a straightforward process. One of the secrets is to have a platform in place that allows you to quickly prototype data applications. In this two-part series we are going to analyze the job market in Ireland. We will use the popular import.io service to ingest a relevant dataset and then use the Dataiku DSS import.io plugin to ingest the data and use Dataiku DSS functionality to generate some insights.

This blog post was originally written and published by sonra.io. You can find the original blog post by clicking here.

About sonra.io

 

Sonra logo

 

sonra.io is the leader for scalable big data solutions in Ireland. They help their clients to gain a strategic advantage over their competition by turning their data into information, gaining actionable insights from information, and turning the insights into a profit impact.

About import.io

 

Import

 

import.io is an online platform to easily extract data from websites. Using the magic extractor is as easy as 1,2,3 to generate a virtual API on top of the required data set and then iterate over it.

The data set we are interested in can be found here. We pass this URL into the import.io Magic API, run the query and get the data back in a nice table.

Magic API table

Next we generate the virtual API

virtual API generation

Querying the API will return a JSON object. In a next step we now use the import.io plugin in Dataiku DSS to ingest the data and paste the URL to the virtual API into the API URL field

virtual API into the API

In order to page/iterate over the API we need to create a Dataiku DSS recipe that passes the the irish jobs.ie source URLs into the virtual API. A recipe always requires an input dataset. We use an Oracle database as the source to generate this dataset using Connect By clause and a little trick.

dataiku dss recipe

In a next step, we run the recipe and explore the output dataset. Using the Columns Quick View we can get a nice overview on the distribution of the various columns of our dataset.

distinct values in dataiku dss

We can explore the data in more detail, e.g. which agency has the most jobs advertised.

exploring the data with dataiku dss

Or the range of salaries

distinct values in dataiku dss

Or the location

location values in dataiku dss

 It took me longer to write up this post than it took to ingest and explore the data. In the next part we will have a look at how easy it is to transform our data and gain some more insights on what skills are most in demand at the moment and which skills correlate with each other. Until then don’t stop to unleash the value of your data.

To try Dataiku DSS's Free Edition now, just click here. 

 

You May Also Like

Taming LLM Outputs: Your Guide to Structured Text Generation

Read More

No-Code ML and GenAI With Dataiku and Fabric

Read More

The Objects of an LLM Mesh for Building LLM-Powered Applications

Read More

Data Lineage: The Key to Impact and Root Cause Analysis

Read More