url-extractor

There are 32 repositories under url-extractor topic.

  • patternhelloworld/url-knife

    Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity

    Language:TypeScript3553315
  • galer

    dwisiswant0/galer

    A fast tool to fetch URLs from HTML attributes by crawl-in.

    Language:Go25961536
  • r3dxpl0it/Damn-Small-URL-Crawler

    A Minimal Yet Powerful Crawler for Extracting all The Internal/External/Fuzz-able Links from a website

    Language:Python330111
  • sp34rh34d/WebRunner

    Web scraping | Website cloner | Path Traversal Scanner

    Language:Python15104
  • eversinc33/urlExtract

    Recursively extract urls from a web page for reconnaissance.

    Language:Go12213
  • httpreserve/tikalinkextract

    Tika based link (URL) extractor for httpreserve

    Language:HTML10390
  • hueristiq/xurls

    A command-line utility for working with URLs.

    Language:Go9201
  • superbeppe98/ebay-listing-matches

    The eBay Listing Matcher is a Python script designed to compare and match eBay listings with parts from an Inventree instance. This script utilizes the eBay Trading API and the Inventree API to gather and process data.

    Language:Python7101
  • hrbrmstr/drill-twitter-text

    An Apache Drill UDF for working with Twitter tweet text via the twitter-text Java library (https://github.com/twitter/twitter-text/tree/master/java)

    Language:Java5203
  • limit-zero/url-juicer

    🍊🔗 Squeeze some juice from URLs: A URL crawler/extraction library.

    Language:JavaScript5300
  • woodenphone/UrlSort

    Tool for extracting and sorting links from a text file.

    Language:Python3200
  • akash-rajak/URL-Extractor

    A python script to extract URL from the text or paragraph.

    Language:Python2101
  • dotdesh71/Website-URL-Scanner

    Website URL Scanner is a simple command-line tool that allows you to scan a website and extract all URLs. It can be useful for various purposes, such as link analysis or checking for broken links.

    Language:Batchfile2100
  • hobsRKM/article-miner

    Extract article title, description, images, keywords and authors from any URL

    Language:JavaScript2100
  • john-carroll-sw/get-item-image

    This repository demonstrates how to use OpenAI's Response API (with GPT-4.1 and tool calling) to extract the main product image URL from an e-commerce product page. It provides both Python and TypeScript implementations, returning a structured output for easy integration.

    Language:Python200
  • khalidelboray/SrcSearch

    Extract URLs,endpoints,paths and word-lists form source files

    Language:Raku2001
  • mortyobnoxious/kickvod-extractor

    extract master url of a kick vod

    Language:HTML20
  • OpenBookPublishers/geturls

    Extact all URLs from anchor and image tags within a html/xhtml page and its children.

    Language:Shell2502
  • SeyedMahmoudMousavi/url-extractor

    Extract urls from your a file or web address

    Language:PHP2200
  • aiwebautomation/anytext2images

    Any blob of text → Extract the images → Preview & download = Fast

    Language:Python1
  • gl0bal01/bookmarklets

    Curated Bookmarks for OSINT & Investigations

    Language:JavaScript1
  • hueristiq/hq-go-url

    A Go (Golang) package for working with URLs. It provides robust tools for extracting URLs from text and parsing them into granular components.

    Language:Go1100
  • Ishanoshada/ApacheStatusExtractor

    Python tool to extract PID, Method, VHost, and Request from Apache server-status pages. Supports single and bulk URL checks for security analysis

    Language:Python1
  • KOUISAmine/url-extractor

    Extract http/https URLs from any kind of text content.

    Language:HTML110
  • loneicewolf/RE_QUERY

    A tool I use to search the web efficiently

  • ryanaryap/URL-Extractor

    URL Extractor is a simple Python code designed to extract the domain name from a list of URLs stored in a text file. This application provides a convenient way to extract and process URLs efficiently.

    Language:Python1100
  • zedseven/urlextractor

    A small tool for extracting all urls from a blob of binary data (ex. PDFs).

    Language:Go1101
  • manyfacedllama/email-attachment-url-extractor

    File attachment and URL extractor for EML & MSG files using Python

    Language:Python0100
  • trixxmanaty/extract-urls-to-file

    This project contains a Python script to extract all unique absolute URLs from a webpage and write them into a text file. This can be useful for indexing purposes.

    Language:Python0100
  • AlteredAdmin/Link-Lifter

    LinkLifter is a Python script that searches for URLs in a given text file or recursively in a directory and its subdirectories. The found URLs, along with the file they are located in, are saved to a CSV file.

    Language:Python10
  • jessicasimplicio/mD-links-lib

    Bootcamp Laboratoria - Produto final do sprint 4. Biblioteca no npm para extracao de links em documento markdown.

    Language:JavaScript10
  • superbeppe98/url-title-extractor

    URL Title Extractor is a Python program that extracts the titles of web pages from a file containing URLs. It uses the requests and BeautifulSoup libraries to extract the title and decode any HTML entities.

    Language:Python10