hvt1609

Pinned Repositories

13th-Place-Solution-Digital-Green-Crop-Yield-Estimate-Challenge-
Language:Jupyter Notebook0 0 00
5K-Compliance
Language:Python0 1 00
ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python00
AT-GCN
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
Language:Python0 0 00
ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Language:Jupyter Notebook00
audioset_tagging_cnn
Language:Python0 1 00
audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Language:Python0 0 00
autogen-experiments
Language:Python00
automatic_prompt_engineer
Language:Python0 0 00

hvt1609's Repositories

hvt1609/13th-Place-Solution-Digital-Green-Crop-Yield-Estimate-Challenge-
Language:Jupyter Notebook0 0 00
hvt1609/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python00
hvt1609/ATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Language:Jupyter Notebook00
hvt1609/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Language:Python0 0 00
hvt1609/autogen-experiments
Language:Python00
hvt1609/automatic_prompt_engineer
Language:Python0 0 00
hvt1609/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
00
hvt1609/awesome-multiple-object-tracking
Resources for Multiple Object Tracking (MOT)
0 0 00
hvt1609/blood-vessel-segmentation
Language:Jupyter Notebook0 0 00
hvt1609/blood-vessel-segmentation-public
4th Place solution for SenNet + HOA - Hacking the Human Vasculature in 3D competition
hvt1609/ComfyUI-AdvancedLivePortrait
hvt1609/crop-yield-estimate
A machine learning solution to predict the crop yield per acre of rice or wheat crops in India. The goal is to empower these farmers and break the cycle of poverty and malnutrition.
hvt1609/Diffusion_models_tutorial
hvt1609/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
hvt1609/Dynamic-YOLO
hvt1609/FeatEng
The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, which requires domain knowledge in addition to a deep understanding of the underlying problem and data structure.
hvt1609/FourierGNN
Official implementation of the paper "FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective"
hvt1609/kagglebirdcall
Training code of Cornell Birdcall Identification Challenge 6th place solution
Language:Python
hvt1609/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
hvt1609/Noisy-ArcMix
Noisy-ArcMix: Additive Noisy Angular Margin Loss Combined With Mixup for Anomalous Sound Detection
hvt1609/OpenChallenge
hvt1609/openvino_notebooks
📚 Jupyter notebook tutorials for OpenVINO™
hvt1609/pose-bowl-spacecraft-challenge
Winning solutions from the Pose Bowl: Spacecraft Detection and Pose Estimation Challenge
hvt1609/rag-pdf-local-tutorial
An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.
hvt1609/RMB-CPI-Nowcasting-Challenge-Notes
Some thoughts on data selection, feature engineering, and model selection following the conclusion of the RMB CPI Nowcasting Challenge hosted on the Zindi platform based on experiences building the second placed model.
hvt1609/ScoreDiffusionModel
The Pytorch Tutorial of Score-based and Diffusion Model
hvt1609/sealion
South-East Asia Large Language Models
hvt1609/segment-vasculature-5th-place
3D segmentation of blood vessels based on Hierarchical Phase-Contrast Tomography (HiP-CT)
hvt1609/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
hvt1609/stable-audio-tools
Generative models for conditional audio generation