Software Developer (Python) – Dimensions

Responsibilities:

Harvest websites using raw http requests and automating browser using tools like Selenium or Scrapy, using all kind of web based APIs (REST, SOAP, …)
Implement batch jobs to retrieve data via common web protocols like HTTP, FTP, …
Extract data from various source formats by implementing heuristics to extract data
Being confronted with very different document formats, reaching from not very well formed HTML code or PDF documents to standard file formats like XML, JSON, CSV and using the default tools to process them (like XPATH)
Store extracted data in sql databases (mainly PostgreSQL) in a generic format
Integrate code into our data pipeline driving our whole data processing infrastructure.

We are looking for:

What We Offer

Be part of an international team distributed all over the globe
Relaxed work environment that values innovation, initiative, and energy
On a rainy day you can choose to work remotely, so most communication happens via video calls using Google Hangout
Competitive salary based on experience
Flexible working hours
Hand pick your hardware

Digital Science & Research

Iasi, Iasi

Full Time

On Site