Joel, Pinto da Mata - CV

Contact Information

joelpintomata.com | LinkedIn | joelmatacv@runbox.com

Work Experience

Software Engineer at KPN - DSH - Real-Time Exchange Platform, December 2022 - Present

Developing DSH/KPN DataCatalogue, a DataMesh/DataMarket platform:

  • Implementation of Python based ETL pipelines for data ingestion into DataHub.
  • Pipeline orchestration with Airflow, incorporating custom instrumentation for traceability, recoverability and blacklisting.
  • Led an open-source contribution for DataHu b Azure Blob ingestion.

Development of a confluent-compliant, multi-tenant Schema Registry using Java/Spring Boot.

Tech Stack: Java, Spring, Python, Kafka, Airflow, pySpark, DataHub, Apicurio, Kubernetes, Flux

Senior Data Engineer at DPG Media Nederland, April 2022 – December 2022

Designed and implemented a real-time delivery disruption detection and notification system using a serverless architecture with event-driven data pipelines:

  • Completed all project lifecycle phases with AWS tooling from infrastructure provisioning using Terraform, to CI/CD pipelines, code development and testing
  • Conducted Stakeholder management and equirement and technical analysis activities.

Tech Stack: AWS (Lambda, DynamoDB, API Gateway, Code Build, Code Deploy, SQS, SNS), Node.js, Python, Terraform

Senior Data Engineer at Scival, April 2021–April 2022

Developed a large-scale analytics platform for research data insights:

  • Created data processing and enrichment pipelines with Java, Python, Scala, and Spark.
  • Conducted data analysis and modeling.

Tech Stack: Java, Scala, Python, Spark, Kafka, Hive, AWS (EMR, S3, Redshift, Notebooks)

Delegate Architect/Senior Software Engineer at Mendeley, March 2020–April 2021

Developed a fully customizable solution for research data management, significantly increasing market potential, customer fit and reducing onboarding time.

Led cross-team technical initiatives, established architectural principles, and maintained technical artifacts.

Tech Stack: Java, Spring Boot, MariaDB, PostgreSQL, AWS, SQS, ActiveMQ, Jenkins, Kubernetes

Delegate Architect/Senior Software Engineer at Research Data Search, September 2018–March 2020

Developed Elsevier’s Research Data search Engine currently powering http://data.mendeley.com:

  • Implemented data pipelines for metadata extraction, data classification, and enrichment.
  • Implemented back-end services for overall solution support.

Tech Stack: Java, Python, Spring Boot, Solr, Spark, Apache NiFi, AWS, EMR, GitLab, Kubernetes, Docker

Senior Big Data Engineer at PublicSonar, March 2018 – September 2018

Developed a social media real-time analysis platform for early warning and incident management.

  • Improvement of project scalability, availability, general development process:
  • Migration from stateful to stateless architectures, micro-services benchmarking and tune, - Implementing coding standard, best-practices, CI/CD processes and increasing of testing coverage.
  • Built data processing and analysis pipelines.

Tech Stack: Golang, Spark, Kafka, RabbitMQ, Mongo, GitLab, Docker

Senior Backend Developer at iFlavours

October 2015 – February 2018

  • Developed a greenfield e-commerce web shop and price comparator.

Tech Stack: Java, Scala, Spark, Spring Boot, Play Framework, Mongo, MariaDB, AWS, GitLab, Terraform, Docker

IT Engineer at European Space Agency (ESTEC), February 2011 – September 2015

Various Development Roles between 2007 and 2011

Education

  • Masters in Computer Engineering - Universidade Nova de Lisboa (FCT), Portugal

Certifications

  • The Open Group Certified: TOGAF®
  • Oracle Certified Master Java SE 6 Developer
  • Oracle Certified Professional Java SE 5 Programmer
  • Spring Core V3
  • ITIL V3 Foundations
  • Green Belt Foundational Security Training

Projects

  • Project Thrive: Mentoring junior developers through the OfferZen Foundation.

Skills

  • Languages: (+)Java, Python, (-)Scala, (-)Node.js, (-)Golang
  • Data Processing: Kafka, Spark, Airflow, Apache NiFi
  • Cloud Services: AWS (Lambda, API Gateway, EMR, S3, Redshift, SQS, SNS)
  • DevOps: Terraform, Docker, GitLab, Kubernetes
Clicky