bmax16634

Follow

Ben Maxwell bmax16634

Follow

Data scientist / ML engineer. Python · PyTorch · FastAPI. UT Austin Gen-AI · ASU Data Science.

1 follower · 0 following

bmax16634/README.md

Ben Maxwell

Data scientist and ML engineer focused on practical AI systems.

I focus on small-scale LLM training, evaluation, and shipping the full app around the model.

Featured — SoloLLM

A decoder-only GPT-style transformer designed and pretrained from scratch in PyTorch on a single RTX 3090.

Final 151.9M v3 model trained on a self-curated 10B-token dataset
Beats GPT-2 small overall on a fixed eval suite: held-out perplexity, WikiText-2, LAMBADA, and multiple-choice continuation scoring
Open source code, published Hugging Face model cards, and live completion demo

Code · Model collection · Final v3 model · Live demo

What I'm interested in

Small / efficient models that punch above their weight
Training tricks that matter on real hardware
Honest model evaluation and benchmark comparisons
End-to-end AI products — model in the middle, real product around it
Local-first AI tooling on consumer GPUs

Stack

Primary — Python, PyTorch, FastAPI, SQL
ML / Data — Hugging Face (Transformers, Datasets), NumPy, pandas, Polars, scikit-learn
Also — Node.js, R, Java, JavaScript, Streamlit
Environments — Linux, Docker, Git

Background

UT Austin — Post Graduate Program in Generative AI for Business Applications (Jan 2026)
Arizona State University — B.S. Data Science, summa cum laude (May 2024)
SmartRent — Workforce Management Analyst (forecasting, scheduling optimization, KPI pipelines)

Links

Portfolio — bmax16634.github.io
Hugging Face — @bmax16634

Pinned Loading

bmax16634.github.io bmax16634.github.io Public

JavaScript
soloLLM soloLLM Public

GPT-2-class language models trained from scratch in PyTorch on one RTX 3090, with 10B-token data curation and full GPT-2 comparisons.

Python