Pinned Repositories
Phi3V-Finetuning
Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.
imp
a family of highly capabale yet efficient large multimodal models
openvqa
A lightweight, scalable, and general framework for visual question answering research
prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
ANNS
project to explore approximate nearest neighborhood search method.
CosAttention2d
a 2D cosine attention module inspired by cosFormer: Rethinking Softmax in Attention(https://arxiv.org/abs/2202.08791)
Face_Cam_Exe
build face-recognition exe on win10
LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
NodeGo
A Node.js Web Server for Go Game AI, powered by WGo.js, SabakiHQ/gtp and leela-zero.
Virtual_File_System
A simple file system which is the course project of HDU OS course.
ParadoxZW's Repositories
ParadoxZW/LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
ParadoxZW/NodeGo
A Node.js Web Server for Go Game AI, powered by WGo.js, SabakiHQ/gtp and leela-zero.
ParadoxZW/CosAttention2d
a 2D cosine attention module inspired by cosFormer: Rethinking Softmax in Attention(https://arxiv.org/abs/2202.08791)
ParadoxZW/NERD
NERD: Named Entity Representations for Disambiguation. 2020大学生服务外包大赛国二
ParadoxZW/SamaritanHDU
A roll-call system using face recognition technique and WeChat App Platform.
ParadoxZW/Awesome-Multimodal-Large-Language-Models
Latest Papers and Datasets on Multimodal Large Language Models
ParadoxZW/prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
ParadoxZW/Automate-Anything-is-All-You-Need
ParadoxZW/ChuanhuChatGPT
GUI for ChatGPT API
ParadoxZW/cosFormer
Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention
ParadoxZW/cosformer-pytorch
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
ParadoxZW/Dotfile
ParadoxZW/fancy-and-tricky
remarkable snippets!
ParadoxZW/hexo-deploy-github-pages-action
🚀 GitHub action for deploying a Hexo project to GitHub pages.
ParadoxZW/image-processing-from-scratch
This project contains some interesting image processing algorithms that were wrote in python and c++ from scratch.
ParadoxZW/imp
Powerful multimodal small language models
ParadoxZW/LLaVA
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
ParadoxZW/mmnas
Deep Multimodal Neural Architecture Search
ParadoxZW/mySIFT
course project
ParadoxZW/openvqa
A lightweight, scalable, and general framework for visual question answering research
ParadoxZW/ParadoxZW.github.io
ParadoxZW/PATexercise
ParadoxZW/Phi3V-Finetuning
Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.
ParadoxZW/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
ParadoxZW/shell_display.py
Display a image in shell using 20 lines Python code.
ParadoxZW/Sketch2Attributes
predict the attributes of a sketch of humans
ParadoxZW/Test
test some GitHub feature
ParadoxZW/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ParadoxZW/Visualize_Tool
ParadoxZW/xmchat