Skip to content
View HoracioSoldman's full-sized avatar

Block or report HoracioSoldman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HoracioSoldman/README.md

Hi there 👋

[ A quick Intro.. ]

I’m Horacio Soldman, a Senior Data Engineer & Chevening Alumnus.

I specialise in building scalable, cost-efficient data architectures that bridge the gap between raw data and business intelligence. With a background in Software Engineering and an MSc in Data Science (UK), I focus on optimizing pipelines to drive high-ROI decision-making.

🚀 Current Focus: Upskilling with Databricks, preparing a Data Engineering Certification.

📊 Core Expertise: Cloud Migrations (AWS/GCP), Batch and Streaming pipelines, and Data Governance.

💡 Impact: Proven track record of reducing cloud costs by 40% and increasing pipeline speed by 50%.

🌍 Remote Ready: Based in Madagascar (UTC+3), operating with full overlap for UK/EU/US-East teams.

📫 How to reach me: email at hsoldman@gmail.com or connect via WhatsApp +44 7512975473.

[ My Tech Stack.. ]

Languages & Frameworks Python SQL Apache Spark

Cloud & Data Platforms Google Cloud BigQuery AWS Databricks

Orchestration & Infrastructure Saagie Apache Airflow Terraform Apache Kafka Docker

Pinned Loading

  1. realtime-events-analytics realtime-events-analytics Public

    This is an end-to-end data engineering project which allows realtime analytics of a website clicks events.

    Jupyter Notebook 2

  2. batch-processing-on-aws batch-processing-on-aws Public

    With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset which is available on Transport For London (TFL) website. http…

    Jupyter Notebook 15

  3. PSO_ANN PSO_ANN Public

    Optimising an Artificial Neural Network with Particle Swarm Optimisation instead of Backpropagation

    Jupyter Notebook 3

  4. export-import-mongodb export-import-mongodb Public

    This repository contains two scripts to export and import mongodb collections.

    Shell 7 2