Ramon Perez
Data Scientist & Engineer
Hi there! Ramon here from the DR 🇩🇴 I was recently working at Seldon in London leading their Developer Relations efforts. I am currently freelancing and working on a few personal projects, mainly, [Inventor](inventor.dev).
I'm an educator at heart on all things science, technology, and (as of recently) design.
I have a lot of breath with different technologies that cover the entire data and machine learning lifecycle, and a lot of depth in technology education.
That said, I'm no native to any kind of technology or skill, but I'm still able to speak a few well with a noticeable accent. My goal when I pick up a new challenge or project is to get it done in a unique and yet efficient way while explaining what I did in clearest way possible.
A bit more detail on my experience.
Although I'm no distributed computing expert, I've work on research engineering projects where I've (managed to) successfully set up compute clusters (🖥+🖥+🖥) using different cloud technologies (☁️⛈☁️) without breaking our AWS bar tab (🍻). The goal of the project was to clean and analyze large amounts of data using frameworks like Dask, PySpark and Ray (not together), and then train --what I would have considered then-- large language models.
I've also done some cool data engineering work. I created a straightforward data platform using only AWS services to move, reshape, clean, and make use of all the data of my employer. Although this project was never continued or picked up by another engineer upon my departure, I learned a ton about data quality, infrastructure and how to (or try to avoid) technical debt.
On the software dev side (👨💻), I've built several microservices and web apps, with and without machine learning, to help translate, summarize, and improve educational content. I've written infrastructure as code scripts (with a lot of headaches along the way) to set up a Jupyter Hub instance in GKE and teach different data science classes at different events around Australia.
Experience
Engineering
DX Engineer at Seldon
London
Actively contributed to Seldon's suite of dev tools with emphasis on the paid machine learning inference products, crafted educational content for the OSS tools, delivered talks at conferences on different ML topics, and engage with our developer community while coordinating the MLOps World Meetup group events, fostering stronger connections and support within our network.
Sr. Product Developer | Instructor at Decoded
Remote
Created Decoded's initial data engineering platform on AWS, and developed four machine learning (ML) applications for language translation, churn prediction, fraud detection, and content recommendation. I led the creation of our MLOps engineering program (see it here) and contributed to our React-based presentation tool. I still teach for Decoded's biggest clients.
Teaching
London Business School
Programming Instructor
Remote | Jan to Mar-21
I worked at LBS as an external instructor teaching a series of 6 sessions in statistical programming with Python for a course titled, People Analytics. Here is an overview of the topics I taught in all 6 sessions,
- Intro to Python
- Regression Pt 1
- Regression Pt 2
- Panel Regression and Fixed Effects
- Regularization
- Intro to NLP and Topic Modeling
- Network Analysis
Coder Academy
Data Scientist & Educator
Sydney, AU | Sep-19 to Aug-21
At Coder Academy, I taught courses ranging from introduction to research to data analysis with Python, and with every course, I have been a student of my students as I always learn more from them than they do from me. My main responsibility though was to develop the first accredited Data Science Bootcamp outside of a university in Australia. Said bootcamp application for accreditation was submitted to TEQSA n July 2021. Here are the subjects that I worked on for the past 2 years.
- Programming for Data Analytics
- Advanced Statistics
- Data Engineering
- Machine Learning
- Cloud Computing, Data Ethics and Privacy
- Software Engineering for Data Professionals (ML in Production)
Johns Hopkins University
Lead Quantitative Tutor
Remote | Jan to Aug-19
As the Lead Quantitative Tutor, I guided students through R and Python programming while teaching advanced courses in experimental design, big data analytics, and machine learning. I developed comprehensive introductory programming courses tailored for graduate students of varying technical backgrounds, and created a structured onboarding manual to streamline the hiring and training process for new tutors joining our team.
Research
Research Engineer at INSEAD
Remote
During my tenure at INSEAD, I worked with a team of outstanding researchers to develop algorithms for analyzing company structures and culture. Our main project focused on generating hierarchical organizational structures (vertical or horizontal) by analyzing historical job advertisements from target companies and similar organizations. We leveraged various natural language processing techniques alongside supervised, semi-supervised, and unsupervised machine learning methods. Additionally, I developed a Python package implementing Latent Dirichlet Allocation to automatically extract and select meaningful topics from company reviews on platforms like Glassdoor. This package not only identified key themes but also provided quantitative cultural measures for organizations based on the discovered topics, offering valuable insights into organizational culture and structure.
Research Associate at Global Financial Literacy Excellence Center
Washington, DC, USA
At GFLEC, I led multiple research initiatives focused on retirement and financial education. I developed and deployed a comprehensive MTurk survey examining millennials' financial education seeking behaviors, while simultaneously conducting extensive literature reviews and expert interviews with 20 policymakers and retirement specialists to understand phased retirement programs. Using the Health and Retirement Study dataset, I built statistical models in Stata and R to analyze patterns between future workforce participation and baby boomers' gradual retirement trends, providing valuable insights into retirement planning and financial literacy.
Research Intern at Service Management Group
Kansas City, MO, USA
Created ArcGIS maps to visualize consumer behavior data, while concurrently conducting sentiment analysis using SPSS and Python, and orchestrating A/B tests to assess the efficacy of various data collection strategies like email, mail, and point-of-sale.
Technical Skills
Languages
Native Speaker: Python, English, JavaScript and Spanish
Intermediate: French and Rust