Pinned Repositories
-Offer-
剑指Offer的AC代码
18369
18.369/8.315 - Mathematical Methods in Nanophotonics course
2021-3D-Object-Detection
This product identifies and labels 3D Objects in images of every day settings, such as cars, trees, bikes, pedestrians, etc. This product makes use of a UNet, which is a Convolutional Neural Network, to identify objects, given voxel data. Our product first takes point cloud data from the SemanticKITTI dataset, and converts it to voxels. For the sake of simplicity, a voxel can be described as a 3d pixel. We visualize these voxels as cubes, each cube containing spatial information in 3 dimensions.
2D-and-3D-face-alignment
This repository implements a demo of the networks described in "How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)" paper.
3d-human-overview
3d-human-reconstruction
3d-pedestrian-detection
TFG
faceJniDll
fcl
Flexible Collision Library
HCNetSDK-demo
海康网络摄像机SDK测试(抓拍,录像)
dfqytcom's Repositories
dfqytcom/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
dfqytcom/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
dfqytcom/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
dfqytcom/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
dfqytcom/codellama
Inference code for CodeLlama models
dfqytcom/EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
dfqytcom/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
dfqytcom/Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
dfqytcom/feishu-openai
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
dfqytcom/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion
dfqytcom/financial-chat
A financial chat application powered by Langchain, OpenBB, and Claude 3 Opus
dfqytcom/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
dfqytcom/images-that-sound
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
dfqytcom/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
dfqytcom/lerobot
🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch
dfqytcom/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
dfqytcom/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
dfqytcom/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
dfqytcom/Mycobot_Tutorials
同济子豪兄大象机械臂Mycobot 280 Pi教程。机器人运动学、逆运动学、Python控制、ROS、具身智能。
dfqytcom/open-interpreter
A natural language interface for computers
dfqytcom/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
dfqytcom/openai-cookbook
Examples and guides for using the OpenAI API
dfqytcom/openai-python
The official Python library for the OpenAI API
dfqytcom/PaSCo
[CVPR 2024 Oral - Best paper award candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"
dfqytcom/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
dfqytcom/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
dfqytcom/seed-tts-eval
dfqytcom/streamv2v
Official Pytorch implementation of StreamV2V.
dfqytcom/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
dfqytcom/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs