Pinned Repositories
alex-ht
Config files for my GitHub profile.
alpine-opencv-docker
Pre-built OpenCV for armhf Alpine Linux 3.6
apt-cyg
Apt-cyg, an apt-get like tool for Cygwin
asr-dev-dockerfile
aurora2_egs
example scripts for AURORA2
aurora4_egs
example scripts for AURORA4
BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json
bypass
Bypass domain, CIDR list. Block domain list.
options-segmenter
scripts to build a keyword-filler based recognizer for four-option single choice question speech segmentation.
timit-nnet3
TIMIT,用nnet3-tdnn腳本(multi-splice)
alex-ht's Repositories
alex-ht/alex-ht
Config files for my GitHub profile.
alex-ht/BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json
alex-ht/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
alex-ht/client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
alex-ht/common
Common source, scripts and utilities shared across all Triton repositories.
alex-ht/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
alex-ht/data_tooling
Tools for managing datasets for governance and training.
alex-ht/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
alex-ht/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
alex-ht/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
alex-ht/DeepSpeedExamples
Example models using DeepSpeed
alex-ht/DifferentiableBinarization
DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow
alex-ht/evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
alex-ht/flash-attention
Fast and memory-efficient exact attention
alex-ht/GMAN
GMAN: A Graph Multi-Attention Network for Traffic Prediction (GMAN, https://fanxlxmu.github.io/publication/aaai2020/) was accepted by AAAI-2020.
alex-ht/googlesearch
A Python library for scraping the Google search engine.
alex-ht/k2chain
alex-ht/langchain
⚡ Building applications with LLMs through composability ⚡
alex-ht/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
alex-ht/minio-cpp
MinIO C++ Client SDK for Amazon S3 Compatible Cloud Storage
alex-ht/mistral-common
alex-ht/nemo_cp_debug
alex-ht/olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
alex-ht/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
alex-ht/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
alex-ht/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
alex-ht/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
alex-ht/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
alex-ht/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
alex-ht/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors