mbijon's Repositories
mbijon/attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
mbijon/book-skeleton
Skeleton project for an Asciidoctor-based e-book
mbijon/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
mbijon/css-selector-tool
A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors
mbijon/docuseal
Open source DocuSign alternative. Create, fill, and sign digital documents ✍️
mbijon/finresearchdataset
mbijon/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
mbijon/fuzzdb
Dictionary of attack patterns and primitives for black-box application fault injection and resource discovery.
mbijon/javascript-heap-inspector
mbijon/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
mbijon/lost-pixel
Holistic visual testing for your Frontend 🖼 First class integration with Storybook, Ladle & other frontend libraries.
mbijon/MidJourney-Styles-and-Keywords-Reference
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
mbijon/min-dalle
min(DALL·E) is a minimal implementation of DALL·E Mini in PyTorch
mbijon/mup
maximal update parametrization (µP)
mbijon/node-ytdl-core
YouTube video downloader in javascript.
mbijon/openhaystack
Build your own 'AirTags' 🏷 today! Framework for tracking personal Bluetooth devices via Apple's massive Find My network.
mbijon/pgvectorscale
A complement to pgvector for high performance, cost efficient vector search on large workloads.
mbijon/pi-hole
A black hole for Internet advertisements
mbijon/portmaster
copy of safing.io/portmaster/
mbijon/pr-agent
🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
mbijon/puppeteer-heap-snapshot
API and CLI tool to fetch and query Chome DevTools heap snapshots.
mbijon/pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
mbijon/recruiter_rm
A small utility to send personalized responses to recruiters
mbijon/svg-gobbler
Open source browser extension for finding, editing, exporting, optimizing, and managing SVG content.
mbijon/terraspace
Terraspace: The Terraform Framework
mbijon/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
mbijon/The-Pragmatic-Programmer
Summary of the book The Pragmatic Programmer by Andrew Hunt and David Thomas
mbijon/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
mbijon/yoha
A practical hand tracking engine.
mbijon/youtube-dl
Command-line program to download videos from YouTube.com and other video sites