Skip to content
View bmax16634's full-sized avatar

Block or report bmax16634

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bmax16634/README.md

Ben Maxwell

Data scientist and ML engineer focused on practical AI systems.

I focus on small-scale LLM training, evaluation, and shipping the full app around the model.

Featured — SoloLLM

A decoder-only GPT-style transformer designed and pretrained from scratch in PyTorch on a single RTX 3090.

  • Final 151.9M v3 model trained on a self-curated 10B-token dataset
  • Beats GPT-2 small overall on a fixed eval suite: held-out perplexity, WikiText-2, LAMBADA, and multiple-choice continuation scoring
  • Open source code, published Hugging Face model cards, and live completion demo

Code · Model collection · Final v3 model · Live demo

What I'm interested in

  • Small / efficient models that punch above their weight
  • Training tricks that matter on real hardware
  • Honest model evaluation and benchmark comparisons
  • End-to-end AI products — model in the middle, real product around it
  • Local-first AI tooling on consumer GPUs

Stack

  • Primary — Python, PyTorch, FastAPI, SQL
  • ML / Data — Hugging Face (Transformers, Datasets), NumPy, pandas, Polars, scikit-learn
  • Also — Node.js, R, Java, JavaScript, Streamlit
  • Environments — Linux, Docker, Git

Background

  • UT Austin — Post Graduate Program in Generative AI for Business Applications (Jan 2026)
  • Arizona State University — B.S. Data Science, summa cum laude (May 2024)
  • SmartRent — Workforce Management Analyst (forecasting, scheduling optimization, KPI pipelines)

Links

Pinned Loading

  1. bmax16634.github.io bmax16634.github.io Public

    JavaScript

  2. soloLLM soloLLM Public

    GPT-2-class language models trained from scratch in PyTorch on one RTX 3090, with 10B-token data curation and full GPT-2 comparisons.

    Python