Ramon Perez

Data Scientist & Engineer

hello-ramon@pm.me ramonpzg Sydney, AU

Hi there! Ramon here from the DR 🇩🇴 I was recently working at Seldon in London leading their Developer Relations efforts. I am currently freelancing and working on a few personal projects, mainly, [Inventor](inventor.dev).

I'm an educator at heart on all things science, technology, and (as of recently) design.

I have a lot of breath with different technologies that cover the entire data and machine learning lifecycle, and a lot of depth in technology education.

That said, I'm no native to any kind of technology or skill, but I'm still able to speak a few well with a noticeable accent. My goal when I pick up a new challenge or project is to get it done in a unique and yet efficient way while explaining what I did in clearest way possible.

A bit more detail on my experience.

Although I'm no distributed computing expert, I've work on research engineering projects where I've (managed to) successfully set up compute clusters (🖥+🖥+🖥) using different cloud technologies (☁️⛈☁️) without breaking our AWS bar tab (🍻). The goal of the project was to clean and analyze large amounts of data using frameworks like Dask, PySpark and Ray (not together), and then train --what I would have considered then-- large language models.

I've also done some cool data engineering work. I created a straightforward data platform using only AWS services to move, reshape, clean, and make use of all the data of my employer. Although this project was never continued or picked up by another engineer upon my departure, I learned a ton about data quality, infrastructure and how to (or try to avoid) technical debt.

On the software dev side (👨‍💻), I've built several microservices and web apps, with and without machine learning, to help translate, summarize, and improve educational content. I've written infrastructure as code scripts (with a lot of headaches along the way) to set up a Jupyter Hub instance in GKE and teach different data science classes at different events around Australia.

Experience

Engineering

DX Engineer at Seldon

London

Jun-23 to Oct-24

Actively contributed to Seldon's suite of dev tools with emphasis on the paid machine learning inference products, crafted educational content for the OSS tools, delivered talks at conferences on different ML topics, and engage with our developer community while coordinating the MLOps World Meetup group events, fostering stronger connections and support within our network.

Sr. Product Developer | Instructor at Decoded

Remote

Nov-21 to Present

Created Decoded's initial data engineering platform on AWS, and developed four machine learning (ML) applications for language translation, churn prediction, fraud detection, and content recommendation. I led the creation of our MLOps engineering program (see it here) and contributed to our React-based presentation tool. I still teach for Decoded's biggest clients.

Teaching

London Business School

Programming Instructor

Remote | Jan to Mar-21

Coder Academy

Data Scientist & Educator

Sydney, AU | Sep-19 to Aug-21

Johns Hopkins University

Lead Quantitative Tutor

Remote | Jan to Aug-19

Research

Research Engineer at INSEAD

Remote

Oct-19 to April-21

During my tenure at INSEAD, I worked with a team of outstanding researchers to develop algorithms for analyzing company structures and culture. Our main project focused on generating hierarchical organizational structures (vertical or horizontal) by analyzing historical job advertisements from target companies and similar organizations. We leveraged various natural language processing techniques alongside supervised, semi-supervised, and unsupervised machine learning methods. Additionally, I developed a Python package implementing Latent Dirichlet Allocation to automatically extract and select meaningful topics from company reviews on platforms like Glassdoor. This package not only identified key themes but also provided quantitative cultural measures for organizations based on the discovered topics, offering valuable insights into organizational culture and structure.

Research Associate at Global Financial Literacy Excellence Center

Washington, DC, USA

Sep-18 to April-19

At GFLEC, I led multiple research initiatives focused on retirement and financial education. I developed and deployed a comprehensive MTurk survey examining millennials' financial education seeking behaviors, while simultaneously conducting extensive literature reviews and expert interviews with 20 policymakers and retirement specialists to understand phased retirement programs. Using the Health and Retirement Study dataset, I built statistical models in Stata and R to analyze patterns between future workforce participation and baby boomers' gradual retirement trends, providing valuable insights into retirement planning and financial literacy.

Research Intern at Service Management Group

Kansas City, MO, USA

Oct-16 to Sep-17

Created ArcGIS maps to visualize consumer behavior data, while concurrently conducting sentiment analysis using SPSS and Python, and orchestrating A/B tests to assess the efficacy of various data collection strategies like email, mail, and point-of-sale.

Technical Skills

Languages

Native Speaker: Python, English, JavaScript and Spanish

Intermediate: French and Rust

Technologies

PyTorchDockerFastAPISolidJSNumPypandasAstroGitTauri