lucas0214

Pinned Repositories

annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook00
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
00
awesome-road-map
🗺️awesome-road-map🗺️
Language:Python00
Bert-Chinese-Text-Classification-Pytorch
使用Bert，ERNIE，进行中文文本分类
Language:Python00
BERT-chinese-text-classification-pytorch-1
This repo contains a PyTorch implementation of a pretrained BERT model for text classification.
Language:Python0 0 00
blog
Public repo for HF blog posts
Language:Jupyter Notebook0 0 00
caffe
Caffe: a fast open framework for deep learning.
Language:C++00
CV
✔（已完结）最全面的深度学习笔记【土堆 Pytorch】【李沐动手学深度学习】【吴恩达深度学习】
Language:Jupyter Notebook10
NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
10
video-question-answering
Video Question Answering via Gradually Refined Attention over Appearance and Motion
Language:Python10

lucas0214's Repositories

lucas0214/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook00
lucas0214/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
00
lucas0214/blog
Public repo for HF blog posts
Language:Jupyter Notebook0 0 00
lucas0214/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
lucas0214/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
lucas0214/doing_the_PhD
lucas0214/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
lucas0214/face_recognition
The world's simplest facial recognition api for Python and the command line
lucas0214/fufan-chat-api
基于RAG的私有知识库问答系统
lucas0214/GPT4Vis
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
lucas0214/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
lucas0214/lighthouse
A user-friendly library for reproducible video moment retrieval and highlight detection.
lucas0214/llm-viz
3D Visualization of an GPT-style LLM
lucas0214/LLMBind
LLMBind: A Unified Modality-Task Integration Framework
lucas0214/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
lucas0214/lmms-models
Language:Python
lucas0214/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
lucas0214/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
lucas0214/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (Qwen2.5, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
lucas0214/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
lucas0214/openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
lucas0214/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
lucas0214/ProLLaMA
A Protein Large Language Model for Multi-Task Protein Language Processing
lucas0214/Transformer-from-scratch
lucas0214/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
lucas0214/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
lucas0214/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
lucas0214/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
lucas0214/VideoPipe
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化（视频分析）框架，觉得有帮助的请给个星星 : )
lucas0214/whisper
Robust Speech Recognition via Large-Scale Weak Supervision