Skip to content
View pufanyi's full-sized avatar
🏫
At school
🏫
At school

Highlights

  • Pro

Block or report pufanyi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pufanyi/README.md

Pinned Loading

  1. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

    Python 3.6k 504

  2. EvolvingLMMs-Lab/Otter EvolvingLMMs-Lab/Otter Public

    🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

    Python 3.3k 209

  3. EvolvingLMMs-Lab/lmms-engine EvolvingLMMs-Lab/lmms-engine Public

    A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

    Python 706 31

  4. OpenSenseNova/SenseNova-SI OpenSenseNova/SenseNova-SI Public

    Scaling Spatial Intelligence with Multimodal Foundation Models

    Python 160 8

  5. EvolvingLMMs-Lab/VideoMMMU EvolvingLMMs-Lab/VideoMMMU Public

    Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

    Python 64 3

  6. EvolvingLMMs-Lab/lmms-lab-writer EvolvingLMMs-Lab/lmms-lab-writer Public

    Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing

    TypeScript 7