shigabeev

🥐

Ilya Shigabeev shigabeev

🥐

60 followers · 48 following

langswap.app
https://langswap.app

Achievements

Highlights

Stars

shotafujie / asrlance

「あすらんす」は音声認識性能を比較評価するツールです．音声ファイルパスと正解文を実行時に入力することで，認識精度（マイクロCER），処理にかかった時間，CPU使用率を結果として出力します．

Python 4 1 Updated Jun 22, 2026

sanghyang00 / ur-bert

Official implementation of the Interspeech 2026 paper: UR-BERT: Scaling Text Encoders for Massively Multilingual TTS Through Universal Romanization and Speech Token Prediction

Python 8 1 Updated Jun 17, 2026

adjacentai / cream-typer

🎙️ Voice translation in any direction. Locally on Apple Silicon. Tap Caps Lock, speak your language, get any other. Whisper.cpp, no cloud, no GPU rental.

Python 8 1 Updated May 2, 2026

herimor / voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

Python 241 30 Updated May 30, 2026

langswap-app / langswap

Self-hosted AI video dubbing with ASR, translation, voice cloning, subtitles, and local GPU inference.

Python 31 5 Updated Jun 22, 2026

albumentations-team / albu-spec

Python 84 Updated May 21, 2026

albumentations-team / benchmark

Python 94 3 Updated May 21, 2026

albumentations-team / albumentations_examples

Augmentations usage examples for albumentations library

Python 540 102 Updated Jun 12, 2026

albumentations-team / albucore

A high-performance image processing library designed to optimize and extend the Albumentations library with specialized functions for advanced image transformations. Perfect for developers working …

Python 121 11 Updated Jun 15, 2026

albumentations-team / AlbumentationsX

Next-generation Albumentations: dual-licensed for open-source and commercial use

Python 496 31 Updated Jun 29, 2026

Deep-unlearning / smol-audio

Practical, Colab-friendly notebooks for fine-tuning and running audio AI models

Jupyter Notebook 418 29 Updated May 19, 2026

IDRnD / redimnet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 199 17 Updated Sep 24, 2025

debpalash / OmniVoice-Studio

The open-source ElevenLabs alternative for local voice cloning, design, create, dubbing and dictation Desktop App

Python 7,785 1,217 Updated Jul 1, 2026

Netflix / void-model

Python 1,918 179 Updated Jun 20, 2026

trapexit / mergerfs

a featureful union filesystem

C++ 5,715 218 Updated May 27, 2026

FujiwaraChoki / MoneyPrinterV2

Automate the process of making money online.

Python 31,123 3,357 Updated Jun 14, 2026

mp-web3 / jarvis-v3

Fully local voice interface for Claude Code on Apple Silicon. Parakeet STT + Kokoro TTS + SmartTurn EOU + dual VAD.

Python 30 2 Updated Mar 24, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 89,385 12,918 Updated Mar 26, 2026

kwatcharasupat / bandit-v2

Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"

Python 61 5 Updated Jul 29, 2025

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,544 267 Updated Jun 17, 2026

IS2AI / TurkicTTS

A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.

Python 84 9 Updated Aug 21, 2023

fishaudio / fish-speech

SOTA Open Source TTS

Python 31,066 2,655 Updated Jun 9, 2026

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 1,056 104 Updated Jun 17, 2026

stllfe / salem

small ass language [extendable with tools] model that follows instructions

Python 2 Updated Oct 16, 2025

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,780 732 Updated Jul 1, 2026

Den4ikAI / runorm

Простой нормализатор текстов перед синтезом речи

Python 47 4 Updated May 13, 2024

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 4,430 287 Updated Apr 8, 2026

Deathbyteacup / fluentbird

FluentBird is a userChrome.css theme for Mozilla Thunderbird, that implemenets Windows 11 Fluent Design and Mica transparency materials.

CSS 634 10 Updated Jun 11, 2026

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 4,467 380 Updated Dec 12, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,257 2,271 Updated Apr 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ilya Shigabeev shigabeev

Achievements

Achievements

Highlights

Block or report shigabeev

Stars

shotafujie / asrlance

sanghyang00 / ur-bert

adjacentai / cream-typer

herimor / voxtream

langswap-app / langswap

albumentations-team / albu-spec

albumentations-team / benchmark

albumentations-team / albumentations_examples

albumentations-team / albucore

albumentations-team / AlbumentationsX

Deep-unlearning / smol-audio

IDRnD / redimnet

debpalash / OmniVoice-Studio

Netflix / void-model

trapexit / mergerfs

FujiwaraChoki / MoneyPrinterV2

mp-web3 / jarvis-v3

karpathy / autoresearch

kwatcharasupat / bandit-v2

thu-ml / TurboDiffusion

IS2AI / TurkicTTS

fishaudio / fish-speech

XiaomiMiMo / MiMo-Audio

stllfe / salem

meta-pytorch / torchtune

Den4ikAI / runorm

openai / harmony

Deathbyteacup / fluentbird

fixie-ai / ultravox

GeeeekExplorer / nano-vllm