Andrey Marin

Jorge Machado

Big Data Engineer. Programmer. Consultant.

Master in Networking and Communication Services. Bachelor in Informatic Engineering. Loves distributed systems.

About me

I'am a freelancer doing work on Big Data. But there are several aspects that differentiate's me from others. I was born in Germany, study Informatics Engineering from 2006 until 2010 and achieved my master on Network and communications Systems on 2012.

Currently doing projects on Big Data using Frameworks like Apache Spark, Kafka, Apache Hive, Apache Hadoop and Apache Flume. Each technology is very suitable for massive processing of data and additionally they work at large scale. If you hire me, I am the one who helps you transit to the new way of processing data.

Before doing freelancing I worked several Years as SAP Administrator on great companies like ZF Friedrichshafen AG or s.Oliver performing SAP upgrades and hardware/OS migrations.

Work Experience.

Here you can see some of my work with a very brief detail. If you need more information download my cv.

SAP Basis Administrator

S.Oliver Gmbh and ZF Friedrichshafen AG

Administration from SAP Landscapes on AIX, Oracle and DB2.
Python Automation Scripts, SAP Upgrades, System Copies, Trex installations and SAP Single Sign on,

2011 - 2016

Data Engineer

Here Maps, Germany

March 2015 - September 2015

This project consists in designing and implementing Big Data Architecture on Amazon Web services using telecommunications data. This project includes Geospatial operation on Spark written in Scala and a Rest API to Spark written in Python

Big Data Trainer

Beuth University of Applied Science, Germany

September 2015

Some times I give extra Big Data trainings as in Berlin University with a duration of two or tree days

Big Data Engineer | Data Scientist


I Help my customers starting or maintenance their Big Data applications. This can go from reprogramming applications using Apache Hive, Apache Hadoop or Apache Spark or reviewing their Lambda Architecture
Furthermore I help my clients that are starting with machine learning using MLib and Real Time Analytics.

2015 - Actual



Distributed programming on frameworks like Hadoop or Spark using Python, Java or Scala.



Think different, bring data together for a real time analysis in a cheaper and in a faster way.


Data Scientist

Getting "Data" is only the half of the process. Connecting the dots is the key.


Here you can see some of my top skills. Abilities or certifications like ITL, AIX, Python or Encryptions protocols are mentioned in detail on my CV.

SAP Administration and Oracle
Big Data Architecture and Developement
Django, JavaScript and Linux

Software Skills

Spark SQL
Apache Spark
Oracle Databases

Consulting on Big Data

Cloudera CDH installation and Sizing
Big Data Architecture
Apache Spark Development
Machine Learning
Hadoop Ecosystem


ITL Foudation 2012
Databricks Certified Developer
SAP OS & DB Migrations



Master in Networking and Communication Services

Instituto Politécnico do Porto, Porto, Portugal

2012 September

Bachelor in Informatics Engineering

Instituto Politécnico do Porto, Porto, Portugal

2010 October

Born 1988


Here you can see if I'm busy or not. I try to keep it up to date

Contact me.

I live in Würzburg, Germany