Ramon Perez
Data Scientist & Engineer
Hi there! I'm an engineer and educator specializing in the design and deployment of machine learning systems. I have over 7 years of experience, and have worked on everything from distributed data processing with Dask and PySpark to building and scaling LLM-powered applications from clusters to single devices that run inside robots. My engineering philosophy is ship it, sweat the details and teach it.
At my core, I'm an educator. I've had the privilege of teaching data science and AI at institutions like Johns Hopkins University, London Business School, and Coder Academy, and for organisations like Mastercard, Capital One, Chanel, Bupa, HSBC and more. This experience has shaped how I build products, making me focus on understanding where users get stuck and creating tools and documentation that remove friction. Currently, I teach MLOps at Decoded and co-run a technology consultancy based in the Dominican Republic, helping businesses navigate the complexities of AI adoption.
Experience
Engineering
Research Engineer at Menlo Research
Remote
Built performance benchmarking and visualization tools for local LLM deployment in robotics contexts. Created robobench (Python/C++ library for measuring real-world LLM performance) and roboviz (real-time visualization of token flow through CPU architecture using Python/Vue.js). Presented research on C++ LLM integration at CppNow 2025. Rebuilt Jan.ai's documentation architecture, created MCP integration tutorials, and conducted user research to shape product strategy. Currently working on simulation platform and data engine infrastructure for the Asimov robotics team.
DX Engineer at Seldon
London
Actively contributed to Seldon's suite of dev tools with emphasis on the paid machine learning inference products, crafted educational content for the OSS tools, delivered talks at conferences on different ML topics, and engage with our developer community while coordinating the MLOps World Meetup group events, fostering stronger connections and support within our network.
Sr. Product Developer | Instructor at Decoded
Remote
Created Decoded's initial data engineering platform on AWS, and developed four machine learning (ML) applications for language translation, churn prediction, fraud detection, and content recommendation. I led the creation of our MLOps engineering program (see it here) and contributed to our React-based presentation tool. I still teach for Decoded's biggest clients.
Teaching
London Business School
Programming Instructor
Remote | Jan to Mar-21
I worked at LBS as an external instructor teaching a series of 6 sessions in statistical programming with Python for a course titled, People Analytics. Here is an overview of the topics I taught in all 6 sessions,
- Intro to Python
- Regression Pt 1
- Regression Pt 2
- Panel Regression and Fixed Effects
- Regularization
- Intro to NLP and Topic Modeling
- Network Analysis
Coder Academy
Data Scientist & Educator
Sydney, AU | Sep-19 to Aug-21
At Coder Academy, I taught courses ranging from introduction to research to data analysis with Python, and with every course, I have been a student of my students as I always learn more from them than they do from me. My main responsibility though was to develop the first accredited Data Science Bootcamp outside of a university in Australia. Said bootcamp application for accreditation was submitted to TEQSA n July 2021. Here are the subjects that I worked on for the past 2 years.
- Programming for Data Analytics
- Advanced Statistics
- Data Engineering
- Machine Learning
- Cloud Computing, Data Ethics and Privacy
- Software Engineering for Data Professionals (ML in Production)
Johns Hopkins University
Lead Quantitative Tutor
Remote | Jan to Aug-19
As the Lead Quantitative Tutor, I guided students through R and Python programming while teaching advanced courses in experimental design, big data analytics, and machine learning. I developed comprehensive introductory programming courses tailored for graduate students of varying technical backgrounds, and created a structured onboarding manual to streamline the hiring and training process for new tutors joining our team.
Research
Research Engineer at INSEAD
Remote
During my tenure at INSEAD, I worked with a team of outstanding researchers to develop algorithms for analyzing company structures and culture. Our main project focused on generating hierarchical organizational structures (vertical or horizontal) by analyzing historical job advertisements from target companies and similar organizations. We leveraged various natural language processing techniques alongside supervised, semi-supervised, and unsupervised machine learning methods. Additionally, I developed a Python package implementing Latent Dirichlet Allocation to automatically extract and select meaningful topics from company reviews on platforms like Glassdoor. This package not only identified key themes but also provided quantitative cultural measures for organizations based on the discovered topics, offering valuable insights into organizational culture and structure.
Research Associate at Global Financial Literacy Excellence Center
Washington, DC, USA
At GFLEC, I led multiple research initiatives focused on retirement and financial education. I developed and deployed a comprehensive MTurk survey examining millennials' financial education seeking behaviors, while simultaneously conducting extensive literature reviews and expert interviews with 20 policymakers and retirement specialists to understand phased retirement programs. Using the Health and Retirement Study dataset, I built statistical models in Stata and R to analyze patterns between future workforce participation and baby boomers' gradual retirement trends, providing valuable insights into retirement planning and financial literacy.
Research Intern at Service Management Group
Kansas City, MO, USA
Created ArcGIS maps to visualize consumer behavior data, while concurrently conducting sentiment analysis using SPSS and Python, and orchestrating A/B tests to assess the efficacy of various data collection strategies like email, mail, and point-of-sale.
Technical Skills
Languages
Native Speaker: Python, English, JavaScript and Spanish
Intermediate: French and Rust