UMI5751's Stars
jhao104/proxy_pool
Python ProxyPool for web spider
monosans/proxy-list
Lists of HTTP, SOCKS4, SOCKS5 proxies with geolocation info. Updated every hour.
TheSpeedX/PROXY-List
Get PROXY List that gets updated everyday
zaytoun/scihub.py
Python API and command-line tool for Sci-Hub
leovan/SciHubEVA
A Cross Platform Sci-Hub GUI Application
dougy147/scitopdf
Quickly fetch and pop scientific papers.
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery ๐งโ๐ฌ
evil-huawei/evil-huawei
Evil Huawei - ๅไธบไฝ่ฟ็ๆถ
decodingml/llm-twin-course
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
datawhalechina/llm-cookbook
้ขๅๅผๅ่ ็ LLM ๅ ฅ้จๆ็จ๏ผๅดๆฉ่พพๅคงๆจกๅ็ณปๅ่ฏพ็จไธญๆ็
apify/crawlee-python
CrawleeโA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee
CrawleeโA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
scholarly-python-package/scholarly
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
MarkHershey/arxiv-dl
Command-line Paper Downloader for ArXiv, CVF (CVPR, ICCV, WACV) & ECVA (ECCV)
suqingdong/scihub
PDF Downloader with SCI-HUB
adithya-s-k/omniparse
Ingest, parse, and optimize any data format โก๏ธ from documents to multimedia โก๏ธ for enhanced compatibility with GenAI frameworks
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Textualize/rich
Rich is a Python library for rich text and beautiful formatting in the terminal.
BuilderIO/micro-agent
An AI agent that writes (actually useful) code for you
ArronAI007/Awesome-AGI
AGI่ตๆๆฑๆปๅญฆไน ๏ผไธป่ฆๅ ๆฌLLMๅAIGC๏ผ๏ผๆ็ปญๆดๆฐ......
arnabsen1729/Website-to-PDF
App that converts a website to a PDF
CraftJarvis/RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
huggingface/peft
๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! ๐ฆฅ
AgentRev/WindowsAppsUnfukker
PowerShell script to fix WindowsApps-related permission errors and crashes.
Doriandarko/maestro
A framework for Claude Opus to intelligently orchestrate subagents.
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. [โ ๏ธ DEVIKA DOES NOT HAVE AN OFFICIAL WEBSITE โ ๏ธ]
All-Hands-AI/OpenHands
๐ OpenHands: Code Less, Make More
clash-verge-rev/clash-verge-rev
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.