web-search

There are 165 repositories under web-search topic.

  • firecrawl

    firecrawl/firecrawl

    🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

    Language:TypeScript66.9k2577535.2k
  • ScrapeGraphAI/Scrapegraph-ai

    Python scraper based on AI

    Language:Python21.7k1354151.9k
  • InternLM/MindSearch

    🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

    Language:JavaScript6.7k47208667
  • jarun/googler

    :mag: Google from the terminal

    Language:Python6.2k155219532
  • jarun/ddgr

    :duck: DuckDuckGo from the terminal

    Language:Python3.1k57124150
  • felladrin/awesome-ai-web-search

    List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search

    Language:HTML1.1k172581
  • ict-bigdatalab/awesome-pretrained-models-for-information-retrieval

    A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

  • MiniSearch

    felladrin/MiniSearch

    Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

    Language:TypeScript48983954
  • VIDA-NYU/ache

    ACHE is a web crawler for domain-specific search.

    Language:Java47533144135
  • Kurama622/llm.nvim

    A large language model (LLM) support for Neovim, provides commands to interact with LLM (like ChatGPT, ChatGLM, kimi, deepseek, openrouter and local llms). Support Github models.

    Language:Lua40635631
  • iniwap/AIWriteX

    AIWriteX是基于CrewAI、AIForge的新一代智能内容创作平台,从微信公众号自动化工具起步,正在重新定义AI辅助内容创作的边界,融合"搜索+借鉴+AI+创意"四重能力,多种超绝玩法,内容创作充满无限可能。

    Language:Python33589
  • mrkrsl/web-search-mcp

    A simple, locally hosted Web Search MCP server for use with Local LLMs

    Language:TypeScript2891831
  • BigSearch

    garywill/BigSearch

    Browser extension. Definitly more than a GET/POST sender. Handily use search engines via a Flexible Tool! UI has Vimium-like feature 🌐🔍 (Pure-client. No 3rd-party server needed) 大术专搜 既专又广 手敲几下 纵横去往

    Language:JavaScript26452217
  • armindarvish/consult-omni

    A Powerful Versatile Omni Search inside Emacs

    Language:Emacs Lisp2521279
  • jgravelle/pocketgroq

    PocketGroq is a powerful Python library that simplifies integration with the Groq API, offering advanced features for natural language processing, web scraping, and autonomous agent capabilities. Key Features Seamless integration with Groq API for text generation and completion Chain of Thought (CoT) reasoning for complex problem-solving and more.

    Language:Python20891058
  • AstraBert/llama-4-researcher

    Turn topics into essays in seconds!

    Language:Python1912528
  • chenwr727/yuanbao-free-api

    YuanBao-Free-API 是一个允许您通过 OpenAI 兼容接口访问腾讯元宝的服务。

    Language:Python19153253
  • CursorTouch/Web-Navigator

    Web-Navigator is an agent for web browsing and scraping websites.

    Language:Python12821529
  • jingtaozhan/DRhard

    SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.

    Language:Python12842515
  • cnzzx/VSA

    Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

    Language:Python1262510
  • findto

    lucasm/findto

    🔴🟠🔵🟢🟡🟣 Findto is an open source decentralized search assistant. Search on AI, Web, and more. Be free.

    Language:TypeScript95759
  • armindarvish/consult-web

    Powerful Web and Omni Search inside Emacs

    Language:Emacs Lisp94376
  • mikechao/brave-search-mcp

    An MCP Server implementation that integrates the Brave Search API, providing, Web Search, Local Points of Interest Search, Image Search, Video Search and News Search capabilities

    Language:TypeScript8616
  • capjamesg/jamesql

    An in-memory NoSQL database implemented in Python.

    Language:Python83001
  • crawlzone/crawlzone

    Crawlzone is a fast asynchronous internet crawling framework for PHP.

    Language:PHP81111110
  • The-Osint-Toolbox/Search-Engines

    A list of Search Engines that will be useful for different aspect of your work, OSINT, Privacy & OPSEC.

  • sazonovanton/SirChatalot

    SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities, tools and semantic search in vector DB.

    Language:Python724814
  • jingtaozhan/RepBERT-Index

    RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddings. The inner products of them are regarded as relevance scores. Its efficiency is comparable to bag-of-words methods.

    Language:Python663710
  • trovu/trovu

    Search 1000+ websites in a command-line way, with curated and personal shortcuts, organized by namespaces, allowing multiple and typed arguments, with maximum privacy.

    Language:TypeScript66226518
  • tkattkat/google-search-scraper

    TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for web scraping and data collection.

    Language:TypeScript5613
  • jingtaozhan/JPQ

    CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.

    Language:Python531411
  • VIDA-NYU/domain_discovery_tool

    This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.

    Language:JavaScript47155612
  • octopalm

    eddiegulay/octopalm

    OctoPalm.js is a lightweight JavaScript library designed to add real-time, customizable search functionality to your web applications. It provides a seamless search experience with animated results and custom-styled scrollbars, making it a robust solution for enhancing search features on your site.

    Language:JavaScript46121
  • jalpp/recursearch

    A MAS that searches the web recursively to generate comprehensive research reports with or without citations.

    Language:TypeScript425
  • PlustOrg/search-sdk

    Easily use and switch between different web search API providers in TypeScript with a single, unified interface.

    Language:TypeScript23
  • sgaunet/perplexity-go

    A comprehensive Go client library for the Perplexity AI API with support for chat completions, async jobs, streaming, multimodal messages, structured outputs, and web search integration

    Language:Go221406