donghaozhang

Researcher in Computer Vision, Natural Language Processing, Medical Image Analysis

Monash UniversityMelbourne

Pinned Repositories

3D_ESPNet
Efficient Segmentation for Volumetric Data
Language:Python0 3 00
Anisotropic_Fast_Marching
Language:MATLAB4 3 40
arbitrary-text-to-image-papers
A collection of arbitrary text to image papers with code (constantly updating)
1 2 00
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
00
Briain_Segmentation_BRATS17
Brain tumor segmentation for MICCAI 2017 BraTS challenge
Language:Python2 2 01
Fast_Segmentation
Semantic Segmentation Toys
Language:Python7 3 25
llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML00
R2Gen
Language:Python2 1 00
VTGAN
[ICCV2021] [Tensorflow] Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers
Language:Python2 1 06
rivuletpy
Robust 3D Neuron Tracing / General 3D tree structure extraction in Python for 3D images powered by the Rivulet2 algorithm. Pain-free Install & use in 5 mins.
Language:Python66 7 2016

donghaozhang's Repositories

donghaozhang/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
00
donghaozhang/aider
aider is AI pair programming in your terminal
donghaozhang/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
donghaozhang/AstrBot
✨易上手的多平台 LLM 聊天机器人及开发框架✨。支持 QQ、QQ频道、Telegram、微信个人号(Gewechat)、企业微信、飞书、内置 Web Chat，OpenAI GPT、DeepSeek、Ollama、Llama、GLM、Gemini、硅基流动、月之暗面、OneAPI、LLMTuner，支持 LLM Agent 插件开发，可视化面板。一键部署。支持 Dify 工作流、代码执行器、Whisper 语音转文字。
donghaozhang/deep-learning-pytorch-huggingface
donghaozhang/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
donghaozhang/eSearch
截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
donghaozhang/Leetcode_play
Language:Python
donghaozhang/LIMO
LIMO: Less is More for Reasoning
donghaozhang/lobe-ui
🍭 Lobe UI - an open-source UI component library for building AIGC web apps
donghaozhang/lossless-cut
The swiss army knife of lossless video/audio editing
donghaozhang/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
donghaozhang/markitdown
Python tool for converting files and office documents to Markdown.
donghaozhang/MedRAX
MedRAX: Medical Reasoning Agent for Chest X-ray
donghaozhang/Megatron-LM
Ongoing research training transformer models at scale
donghaozhang/node-DeepResearch
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
donghaozhang/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
donghaozhang/pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
donghaozhang/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero
donghaozhang/pippo
Pippo: High-Resolution Multi-View Humans from a Single Image
donghaozhang/quickemu
Quickly create and run optimised Windows, macOS and Linux virtual machines
donghaozhang/R1-V
Witness the aha moment of VLM with less than $3.
donghaozhang/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
donghaozhang/Sa2VA
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
donghaozhang/skyvern
Automate browser-based workflows with LLMs and Computer Vision
donghaozhang/smolagents
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
donghaozhang/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
donghaozhang/try-off-anyone
Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"
donghaozhang/verl
veRL: Volcano Engine Reinforcement Learning for LLM
donghaozhang/Zonos