A collective list of free APIs
-
Updated
Jun 30, 2026 - Python
A collective list of free APIs
Faker is a Python package that generates fake data for you.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.
A MNIST-like fashion product database. Benchmark 👇
Open source annotation tool for machine learning practitioners.
Transformer: PyTorch Implementation of "Attention Is All You Need"
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.
A synthetic data generator for text recognition
Models, data loaders and abstractions for language processing, powered by PyTorch
Extract data from a wide range of Internet sources into a pandas DataFrame.
Community-maintained dataset of 700+ websites for finding accounts by username — powers OSINT and digital footprint tools.
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Colour Science for Python
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.
To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."