Get to know me

About Me

Jonah Zembower

Experience

  • Highmark Health — Data Scientist Intern (Summer 2026)
  • ARxChange — Data Analyst (2026)
  • Walmart ACC 7377 — Operations Management Analyst Intern
  • Brigham & Women's / Harvard Medical — Research Collaborator
  • Peak Performance Biomechanics — Biomechanical Data Specialist

Tech

  • Languages: Python, R, SQL, Java, JavaScript, Swift
  • ML / DL: Scikit-learn, XGBoost, PyTorch, TensorFlow, Hugging Face, Elastic Net
  • Data & Lakehouse: Pandas, NumPy, Tidyverse, BeautifulSoup, Databricks, Apache Spark SQL, Lakehouse Architecture
  • Streaming & MLOps: Apache Kafka, MLflow, DVC (Data Version Control), Prometheus, Grafana, Docker, Container Orchestration
  • Web / APIs: FastAPI, Django REST Framework, Web Scraping, Recommender Systems (Sentence-Transformers), LLMs
  • Cloud: AWS (S3, QuickSight, Quick Suite), Google BigQuery, Apple Developer
  • Viz & BI: Tableau, Looker Studio, Plotly, Matplotlib, Seaborn, Microsoft Power Apps
  • Storage: PostgreSQL, MongoDB, Microsoft SQL Server, MySQL, Parquet
  • Source / DevOps: Git, GitHub, Bitbucket

Undergraduate

  • BS Data Science — Computational Analytics
  • BS Exercise Science — Health / Fitness
  • DII Men's Soccer Player
  • Seton Hill University, Magna Cum Laude (May 2025)
  • Data Science Achievement Award (2023 & 2024)

Graduate

  • MS Health Care Analytics & IT — Carnegie Mellon, Heinz College (expected May 2027)
  • Honors: Fall 2025 Outstanding QPA (Heinz College)
  • Recent work: NFL Big Data Bowl 2026 (Heinz Sports Analytics), UN × Databricks Hackathon, USC Big Data Health Case Competition (MSK Shared Decision Making)
  • Activities & societies: Heinz Sports Analytics Club, Heinz AI Club, Data Science Club, NOVA AI Hackathon, CMU Hackathon, USC Big Data Health Science Case Competition, UN Databricks Hackathon
  • Coursework: Health Care Information Systems · Machine Learning with Python · Machine Learning in Production / AI Engineering · Optimization & Decision Modeling Analytics · Data-Focused Python · EDA & Data Visualizations · Database Management for Policy & Analytics · Health Systems · Health Policy · Applied Econometrics

Healthcare is no longer just about medicine — it’s about the data that guides it. I believe hidden within millions of data points are the insights needed to save lives, streamline care, and build a more accessible healthcare system. That belief is what drives everything I work on.

Currently, I’m turning this vision into practice as I pursue my Master of Science in Health Care Analytics & Information Technology at Carnegie Mellon University (expected May 2027). I pride myself on my adaptability — whether I’m architecting complex data pipelines, modeling clinical disease progression, or navigating the business strategy of healthcare IT, I seamlessly pivot between technical, clinical, and operational concepts.

Before CMU, I graduated Magna Cum Laude from Seton Hill University with dual degrees in Data Science and Exercise Science. At Walmart’s ACC 7377, I built a label-quality monitoring system with Power Automate and Power Apps that generated an estimated $6,000 in cost savings during its first week. Before that, I executed biomechanical and ergonomic analyses at Peak Performance Biomechanics, turning IMU and EMG data into client-ready reports.

This site is the working version of that journey. Browse the projects, research, and writing — and let’s learn more together.