aimerou

AI Research Engineer

BaamtuSenegal

aimerou's Stars

google-research/google-research
Google Research
Language:Jupyter Notebook34.3k 753 1.3k7.9k
ultralytics/ultralytics
Ultralytics YOLO11 🚀
Language:Python32.7k 185 9.4k6.3k
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python24.2k 162 4341.8k
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language:Python14.7k 118 9941.9k
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Language:Python9.4k 77 1.5k2.2k
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.7k 82 153762
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.9k 54 2061.3k
mikel-brostrom/boxmot
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Language:Python6.7k 60 1.1k1.7k
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Language:Python5.3k 63 74418
ifzhang/ByteTrack
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Language:Python4.8k 44 371907
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Language:Python4k 44 156359
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.6k 65 104294
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.5k 57 71304
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Language:Jupyter Notebook1.5k 23 77231
ArjanCodes/betterpython
Code examples for my Write Better Python Code series on YouTube.
Language:Python1.2k 57 3344
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Language:Jupyter Notebook940 17 3642
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Language:Jupyter Notebook937 13 101422
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python830 31 79157
dyhBUPT/StrongSORT
[TMM 2023] StrongSORT: Make DeepSORT Great Again
Language:Python772 11 11778
google-deepmind/tracr
Language:Python508 15 1343
ga642381/Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
103 11 25
getalp/ALFFA_PUBLIC
Language:Shell48 10 398
lwang114/UnsupTTS
Language:Shell36 8 34
khanld/Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining
Language:Python33 2 03
alirezamshi/small100
Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to EMNLP 2022.
Language:Python22 1 02
masakhane-io/masakhane-pos
POS for African languages
Language:Jupyter Notebook17 1 019
uds-lsv/afro-maft
Language:Jupyter Notebook16 7 05
neulab/AfricanVoices
Hosts text-to-speech corpus and speech synthesizers for African languages.
Language:Shell13 2 12
Waxal-Multilingual/speech-data
This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models
Language:JavaScript11 2 55
Waxal-Multilingual/audio-files
Audio files from waxal data collections
2 1 01

aimerou

aimerou's Stars

google-research/google-research

ultralytics/ultralytics

roboflow/supervision

graphdeco-inria/gaussian-splatting

Megvii-BaseDetection/YOLOX

Plachtaa/VALL-E-X

jaywalnut310/vits

mikel-brostrom/boxmot

aiwaves-cn/agents

ifzhang/ByteTrack

dreamgaussian/dreamgaussian

huggingface/distil-whisper

facebookresearch/encodec

timsainb/noisereduce

ArjanCodes/betterpython

chongzhou96/EdgeSAM

NirAharon/BoT-SORT

Tomiinek/Multilingual_Text_to_Speech

dyhBUPT/StrongSORT

google-deepmind/tracr

ga642381/Speech-Prompts-Adapters

getalp/ALFFA_PUBLIC

lwang114/UnsupTTS

khanld/Wav2vec2-Pretraining

alirezamshi/small100

masakhane-io/masakhane-pos

uds-lsv/afro-maft

neulab/AfricanVoices

Waxal-Multilingual/speech-data

Waxal-Multilingual/audio-files