MissPenguin

MissPenguin

49 followers · 2 following

Achievements

Stars

Yuliang-Liu / Monkey

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,947 139 Updated Jan 24, 2026

OpenBMB / InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 370 30 Updated Sep 25, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,390 4,775 Updated Jun 2, 2025

SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,473 144 Updated Mar 7, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 15,088 3,550 Updated Feb 1, 2026

OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,417 105 Updated Mar 3, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,979 2,216 Updated Jul 29, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,265 4,017 Updated Jul 17, 2024

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,513 701 Updated Jan 24, 2026

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,800 253 Updated Dec 12, 2023

mli / paper-reading

深度学习经典、新论文逐段精读

32,505 2,774 Updated Mar 22, 2025

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,236 1,951 Updated Nov 19, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

26,151 2,282 Updated Jul 31, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

22,178 2,103 Updated May 19, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 23,159 2,817 Updated Jun 12, 2025

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,514 512 Updated Dec 14, 2025

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,057 8,406 Updated Jan 25, 2026

meta-llama / llama

Inference code for Llama models

Python 59,098 9,824 Updated Jan 26, 2025

zai-org / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,219 5,210 Updated Jun 27, 2024

mayooear / ai-pdf-chatbot-langchain

AI PDF chatbot agent built with LangChain & LangGraph

TypeScript 16,337 3,233 Updated Feb 20, 2025

DSXiangLi / DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

3,343 321 Updated Jan 19, 2026

yeungchenwa / OCR-SAM

[Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text…

Python 580 41 Updated Jan 30, 2024

ORDINAND / The-Art-of-Asking-ChatGPT-for-High-Quality-Answers-A-complete-Guide-to-Prompt-Engineering-Technique

ChatGPT提问技巧

1,007 120 Updated Mar 21, 2023

FudanVI / benchmarking-chinese-text-recognition

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Python 502 51 Updated Dec 2, 2022

PaddlePaddle / FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

Python 3,637 690 Updated Jan 31, 2026

PaddlePaddle / PaddleFormers

PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

Python 12,950 2,168 Updated Feb 1, 2026

ArtifexSoftware / pdf2docx

Open source Python library for converting PDF to DOCX.

Python 3,287 471 Updated May 28, 2025

GreatV / optlab

OCR pre-processing Toolbox

C++ 18 3 Updated Nov 29, 2022

RangeKing / OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

Python 33 5 Updated Dec 21, 2022

telppa / PaddleOCR-AutoHotkey

PaddleOCR AutoHotkey Version. PaddleOCR AHK 版。

AutoHotkey 161 21 Updated Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MissPenguin

Achievements

Achievements

Block or report MissPenguin

Stars

Yuliang-Liu / Monkey

OpenBMB / InfiniteBench

lm-sys / FastChat

SkyworkAI / Skywork

NVIDIA / Megatron-LM

OpenLMLab / MOSS-RLHF

tloen / alpaca-lora

tatsu-lab / stanford_alpaca

BlinkDL / ChatRWKV

PhoebusSi / Alpaca-CoT

mli / paper-reading

kaixindelele / ChatPaper

Hannibal046 / Awesome-LLM

HqWu-HITCS / Awesome-Chinese-LLM

datawhalechina / llm-cookbook

lonePatient / awesome-pretrained-chinese-nlp-models

binary-husky / gpt_academic

meta-llama / llama

zai-org / ChatGLM-6B

mayooear / ai-pdf-chatbot-langchain

DSXiangLi / DecryptPrompt

yeungchenwa / OCR-SAM

ORDINAND / The-Art-of-Asking-ChatGPT-for-High-Quality-Answers-A-complete-Guide-to-Prompt-Engineering-Technique

FudanVI / benchmarking-chinese-text-recognition

PaddlePaddle / FastDeploy

PaddlePaddle / PaddleFormers

ArtifexSoftware / pdf2docx

GreatV / optlab

RangeKing / OCR_preprocessing_tool

telppa / PaddleOCR-AutoHotkey