Pinned Repositories
13th-Place-Solution-Digital-Green-Crop-Yield-Estimate-Challenge-
5K-Compliance
ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
AT-GCN
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
audioset_tagging_cnn
audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
autogen-experiments
automatic_prompt_engineer
hvt1609's Repositories
hvt1609/13th-Place-Solution-Digital-Green-Crop-Yield-Estimate-Challenge-
hvt1609/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
hvt1609/ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
hvt1609/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
hvt1609/autogen-experiments
hvt1609/automatic_prompt_engineer
hvt1609/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
hvt1609/awesome-multiple-object-tracking
Resources for Multiple Object Tracking (MOT)
hvt1609/blood-vessel-segmentation
hvt1609/blood-vessel-segmentation-public
4th Place solution for SenNet + HOA - Hacking the Human Vasculature in 3D competition
hvt1609/ComfyUI-AdvancedLivePortrait
hvt1609/crop-yield-estimate
A machine learning solution to predict the crop yield per acre of rice or wheat crops in India. The goal is to empower these farmers and break the cycle of poverty and malnutrition.
hvt1609/Diffusion_models_tutorial
hvt1609/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
hvt1609/Dynamic-YOLO
hvt1609/FeatEng
The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, which requires domain knowledge in addition to a deep understanding of the underlying problem and data structure.
hvt1609/FourierGNN
Official implementation of the paper "FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective"
hvt1609/kagglebirdcall
Training code of Cornell Birdcall Identification Challenge 6th place solution
hvt1609/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
hvt1609/Noisy-ArcMix
Noisy-ArcMix: Additive Noisy Angular Margin Loss Combined With Mixup for Anomalous Sound Detection
hvt1609/OpenChallenge
hvt1609/openvino_notebooks
📚 Jupyter notebook tutorials for OpenVINO™
hvt1609/pose-bowl-spacecraft-challenge
Winning solutions from the Pose Bowl: Spacecraft Detection and Pose Estimation Challenge
hvt1609/rag-pdf-local-tutorial
An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.
hvt1609/RMB-CPI-Nowcasting-Challenge-Notes
Some thoughts on data selection, feature engineering, and model selection following the conclusion of the RMB CPI Nowcasting Challenge hosted on the Zindi platform based on experiences building the second placed model.
hvt1609/ScoreDiffusionModel
The Pytorch Tutorial of Score-based and Diffusion Model
hvt1609/sealion
South-East Asia Large Language Models
hvt1609/segment-vasculature-5th-place
3D segmentation of blood vessels based on Hierarchical Phase-Contrast Tomography (HiP-CT)
hvt1609/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
hvt1609/stable-audio-tools
Generative models for conditional audio generation