donghaozhang
Researcher in Computer Vision, Natural Language Processing, Medical Image Analysis
Monash UniversityMelbourne
Pinned Repositories
3D_ESPNet
Efficient Segmentation for Volumetric Data
Anisotropic_Fast_Marching
arbitrary-text-to-image-papers
A collection of arbitrary text to image papers with code (constantly updating)
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Briain_Segmentation_BRATS17
Brain tumor segmentation for MICCAI 2017 BraTS challenge
Fast_Segmentation
Semantic Segmentation Toys
llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
R2Gen
VTGAN
[ICCV2021] [Tensorflow] Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers
rivuletpy
Robust 3D Neuron Tracing / General 3D tree structure extraction in Python for 3D images powered by the Rivulet2 algorithm. Pain-free Install & use in 5 mins.
donghaozhang's Repositories
donghaozhang/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
donghaozhang/aider
aider is AI pair programming in your terminal
donghaozhang/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
donghaozhang/AstrBot
✨易上手的多平台 LLM 聊天机器人及开发框架✨。支持 QQ、QQ频道、Telegram、微信个人号(Gewechat)、企业微信、飞书、内置 Web Chat,OpenAI GPT、DeepSeek、Ollama、Llama、GLM、Gemini、硅基流动、月之暗面、OneAPI、LLMTuner,支持 LLM Agent 插件开发,可视化面板。一键部署。支持 Dify 工作流、代码执行器、Whisper 语音转文字。
donghaozhang/deep-learning-pytorch-huggingface
donghaozhang/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
donghaozhang/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
donghaozhang/Leetcode_play
donghaozhang/LIMO
LIMO: Less is More for Reasoning
donghaozhang/lobe-ui
🍭 Lobe UI - an open-source UI component library for building AIGC web apps
donghaozhang/lossless-cut
The swiss army knife of lossless video/audio editing
donghaozhang/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
donghaozhang/markitdown
Python tool for converting files and office documents to Markdown.
donghaozhang/MedRAX
MedRAX: Medical Reasoning Agent for Chest X-ray
donghaozhang/Megatron-LM
Ongoing research training transformer models at scale
donghaozhang/node-DeepResearch
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
donghaozhang/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
donghaozhang/pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
donghaozhang/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
donghaozhang/pippo
Pippo: High-Resolution Multi-View Humans from a Single Image
donghaozhang/quickemu
Quickly create and run optimised Windows, macOS and Linux virtual machines
donghaozhang/R1-V
Witness the aha moment of VLM with less than $3.
donghaozhang/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
donghaozhang/Sa2VA
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
donghaozhang/skyvern
Automate browser-based workflows with LLMs and Computer Vision
donghaozhang/smolagents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
donghaozhang/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
donghaozhang/try-off-anyone
Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"
donghaozhang/verl
veRL: Volcano Engine Reinforcement Learning for LLM
donghaozhang/Zonos