Joel, Pinto da Mata - CV
joelpintomata.com | LinkedIn | joelmatacv@runbox.com
Work Experience
Developing DSH/KPN DataCatalogue, a DataMesh/DataMarket platform:
- Implementation of Python based ETL pipelines for data ingestion into DataHub.
- Pipeline orchestration with Airflow, incorporating custom instrumentation for traceability, recoverability and blacklisting.
- Led an open-source contribution for DataHu b Azure Blob ingestion.
Development of a confluent-compliant, multi-tenant Schema Registry using Java/Spring Boot.
Tech Stack: Java, Spring, Python, Kafka, Airflow, pySpark, DataHub, Apicurio, Kubernetes, Flux
Designed and implemented a real-time delivery disruption detection and notification system using a serverless architecture with event-driven data pipelines:
- Completed all project lifecycle phases with AWS tooling from infrastructure provisioning using Terraform, to CI/CD pipelines, code development and testing
- Conducted Stakeholder management and equirement and technical analysis activities.
Tech Stack: AWS (Lambda, DynamoDB, API Gateway, Code Build, Code Deploy, SQS, SNS), Node.js, Python, Terraform
Developed a large-scale analytics platform for research data insights:
- Created data processing and enrichment pipelines with Java, Python, Scala, and Spark.
- Conducted data analysis and modeling.
Tech Stack: Java, Scala, Python, Spark, Kafka, Hive, AWS (EMR, S3, Redshift, Notebooks)
Developed a fully customizable solution for research data management, significantly increasing market potential, customer fit and reducing onboarding time.
Led cross-team technical initiatives, established architectural principles, and maintained technical artifacts.
Tech Stack: Java, Spring Boot, MariaDB, PostgreSQL, AWS, SQS, ActiveMQ, Jenkins, Kubernetes
Delegate Architect/Senior Software Engineer at Research Data Search, September 2018–March 2020
Developed Elsevier’s Research Data search Engine currently powering http://data.mendeley.com:
- Implemented data pipelines for metadata extraction, data classification, and enrichment.
- Implemented back-end services for overall solution support.
Tech Stack: Java, Python, Spring Boot, Solr, Spark, Apache NiFi, AWS, EMR, GitLab, Kubernetes, Docker
Senior Big Data Engineer at PublicSonar, March 2018 – September 2018
Developed a social media real-time analysis platform for early warning and incident management.
- Improvement of project scalability, availability, general development process:
- Migration from stateful to stateless architectures, micro-services benchmarking and tune, - Implementing coding standard, best-practices, CI/CD processes and increasing of testing coverage.
- Built data processing and analysis pipelines.
Tech Stack: Golang, Spark, Kafka, RabbitMQ, Mongo, GitLab, Docker
Senior Backend Developer at iFlavours
October 2015 – February 2018
- Developed a greenfield e-commerce web shop and price comparator.
Tech Stack: Java, Scala, Spark, Spring Boot, Play Framework, Mongo, MariaDB, AWS, GitLab, Terraform, Docker
IT Engineer at European Space Agency (ESTEC), February 2011 – September 2015
Various Development Roles between 2007 and 2011
Education
- Masters in Computer Engineering - Universidade Nova de Lisboa (FCT), Portugal
Certifications
- The Open Group Certified: TOGAF®
- Oracle Certified Master Java SE 6 Developer
- Oracle Certified Professional Java SE 5 Programmer
- Spring Core V3
- ITIL V3 Foundations
- Green Belt Foundational Security Training
Projects
- Project Thrive: Mentoring junior developers through the OfferZen Foundation.
Skills
- Languages: (+)Java, Python, (-)Scala, (-)Node.js, (-)Golang
- Data Processing: Kafka, Spark, Airflow, Apache NiFi
- Cloud Services: AWS (Lambda, API Gateway, EMR, S3, Redshift, SQS, SNS)
- DevOps: Terraform, Docker, GitLab, Kubernetes