edameral's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
mnielsen/neural-networks-and-deep-learning
Code samples for my book "Neural Networks and Deep Learning"
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
musikalkemist/AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
superlinked/superlinked
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
google-research-datasets/conceptual-captions
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
scopeInfinity/Video2Description
Video to Text: Natural language description generator for some given video. [Video Captioning]
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Thiagohgl/ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation.
JaywongWang/DenseVideoCaptioning
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2018, with code, model and prediction results.
jpthu17/DiffusionRet
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
atilsamancioglu/PromptEngineeringCourse
jpthu17/HBI
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
ttengwang/dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
WingsBrokenAngel/Semantics-AssistedVideoCaptioning
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
jpthu17/DiCoSA
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
MichiganCOG/Video-Grounding-from-Text
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
acherstyx/CoCap
[ICCV 2023] Accurate and Fast Compressed Video Captioning
TarikKaanKoc/prompt-engineering
Few-Shot Prompting - Chain-of-Thought (CoT) Prompting - Hallucinations - Self-Consistency - Generated Knowledge Prompting - Tree of Thoughts (ToT) - Summarization - Sentiment Analysis - Entity and Keyword Detection - Inferring Specifications - Intent Detection - Constitutional Prompting - Jailbreaking - Promp
ahmethaydarornek/transfer_learning_tensorflow_keras
Transfer Learning is used to classify images with high performance.
zohrehghaderi/VASTA
A Video-to-Text Framework
yavuzKomecoglu/divaconf2024_assistant_demo
Atölye - Create Your Own Data-Connected Assistant | Diva: Dive into AI Konferansı 13 Temmuz 2024
deepgram-devs/prerecorded-audio-notebook
A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.
canerskrc/Books
Data Science Book
canerskrc/Machine-Learning-101-
willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
erimdogan/MLBOOTCAMP_FINAL_PROJECT
kalpit07/V-Concise
Video Captioning using Seq2Seq Model