pix2tex: Using a ViT to convert images of equations into LaTeX code.
-
Updated
Jan 18, 2025 - Python
pix2tex: Using a ViT to convert images of equations into LaTeX code.
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
A math workspace for screenshot OCR, handwriting-to-LaTeX, office-plugin editing, and symbolic computation, powered by MathCraft OCR and MathLive.
Snap any image, screenshot, or webpage into plaintext. No GPU. No cloud. One command.
GLM-OCRを使ったローカルOCRサーバー(FastAPI + Web UI / 画像・PDF対応)
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
Minimal local-first multimodal RAG library powered by SQLite + sqlite-vec.
A collection of scripts to "help" you with your programming exams and assignments.
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
A Large Language Model (LLM) Based App to Generate Stories from Pictures
TAO71 I4.0 is an AI created by TAO71 in Python.
Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.
[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue
Run im2txt trained model in inference mode
A web-based application that leverages the BLIP-2 model to generate detailed descriptions of uploaded images.
Add a description, image, and links to the image2text topic page so that developers can more easily learn about it.
To associate your repository with the image2text topic, visit your repo's landing page and select "manage topics."