Data Engineering
Building end-to-end pipelines, cloud data synchronization, and structured data extraction at scale
Data Science & Engineering Student @ UC3M
Building + Learning all things data. Always looking to collaborate and learn new things in all technical domains 'If you want to get faster, run with people who are faster than you'
Scroll to exploreI'm a Data Science and Data Engineering student at Universidad Carlos III de Madrid, a Polytechnic University in Madrid, Spain — expected to graduate in May 2027.
Before that, I spent years operating heavy machinery on wheat farms in Odessa, Washington, where I developed a strong work ethic through frequent 14-hour harvest days. That resilience carries into everything I do.
Now I build end-to-end data pipelines, machine learning models, and automation systems. My recent internship at Agops360 had me designing cloud-native pipelines for agricultural data, data architecture, and IoT telemetry.
I'm looking to return to my roots and establish a career in Washington.
Building end-to-end pipelines, cloud data synchronization, and structured data extraction at scale
Statistical modeling, predictive analytics, classification, and unsupervised learning techniques
IoT irrigation systems, farm management APIs, and pesticide regulatory data pipelines
OCR processing, serverless workflows, API integration, and cron-based data enrichment
Created an end-to-end data pipeline for the U.S. EPA's pesticide registration dataset (PPIS). Automated data extraction, cleaning, and synchronization with cloud databases, while employing an LLM agent to extract REI, PHI, and PPE data from PDF pesticide labels in Python.
Developed event-driven data pipelines for an agricultural equipment manufacturer's API to fetch and manage field boundaries, crop types, work plans, and equipment activity. Implemented secure OAuth2 token handling and refresh logic.
Built a data ingestion system connecting an IoT-enabled irrigation network using serverless edge functions, cloud functions, and pub/sub messaging. Captured telemetry data such as pressure, pivot angles, fault states, and communication health.
Designed a fully automated OCR pipeline leveraging cloud-based text extraction services within serverless functions to process PDF irrigation design documents. Parsed and transformed tabular data into CSVs for analytical use.
Led development of machine learning models achieving 89.7% accuracy in predicting e-commerce purchase behavior. Engineered and optimized five classification models (Gradient Boosting, Random Forest, Neural Networks, LDA/QDA) and implemented a risk analysis framework combining PageValues and purchase probabilities.
Coming soon
Writing and notes on data engineering, ML, and whatever I'm learning.
Follow along on my journey, reach out for collaborations, or just say hi.