news-extractor

There are 9 repositories under news-extractor topic.

  • news-please

    fhamborg/news-please

    news-please - an integrated web crawler and information extractor for news that just works

    Language:Python2k54180422
  • SkywalkerDarren/chatWeb

    ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

    Language:Python8762015136
  • currentslab/extractnet

    A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package

    Language:HTML21761521
  • kwaziidev/textractor

    从html中提取正文,用于新闻类网页

    Language:Go15214
  • Tuhin-SnapD/News-Extractor-Summarizer

    The Python-based web app extracts and summarizes news using NewsAPI, newspaper3k, spacy, Pegasus and T5 from Hugging Face. It categorizes news articles and uses a graph-based summary feature to summarize multiple documents. The app works with news in any language supported by NewsAPI.

    Language:Python9114
  • kailashkarthik9/News-Crawler

    Final Year Project. News Extraction and Summarization

    Language:Java1100
  • mrizqiaal/news-extractor

    News Extractor

    Language:Python1200
  • kailashkarthik9/News-Extraction

    Final Year Project. News Extraction and Summarization

    Language:C0100
  • rogelioolarte/NewsExtractor

    API designed to extract large amounts of articles from any URL or website supported the use of CSS selectors documented with Swagger (OpenAPI 3).

    Language:Java0102