ageitgey/node-unfluff
Automatically extract body content (and other cool stuff) from an html document
HTMLApache-2.0
Issues
- 4
Any up to date alternatives?
#108 opened by Aditya94A - 0
Extraction fails on plain text files
#114 opened by bogdanionitabp - 13
Convert to front-end friendly, remove 'fs'
#62 opened by knod - 0
Bad regex causing very slow execution
#112 opened by kduffie - 0
Can i use with utf 8 ?
#111 opened by pedrosarkis - 1
Parse Page Schema.org Data
#103 opened by ISNIT0 - 0
Author is not accurate
#110 opened by chan-dev - 3
Extract not all text
#31 opened by yanosh-igor - 2
Problem with New York Times stories
#76 opened by gautamh - 0
Title element is not correct
#105 opened by AndrejGajdos - 0
303 See Other
#102 opened by angeloh - 0
vietnamese stop words
#101 opened by cfreifeld - 2
- 0
"Some features may not work without JavaScript. Please try enabling it if you encounter problems."
#100 opened by KrishieldKyle - 3
How to get HTML content of the text?
#87 opened by malcommac - 1
Bad lazy author extraction
#94 opened by 8enmann - 0
Extracted Date is Wrong
#93 opened by AgoloAhmedElhady - 5
loadash needs update
#91 opened by cekvenich - 1
- 2
Links and images
#90 opened by gbelvedere - 0
- 2
400 Bad Request
#79 opened by AndrejGajdos - 0
unfluff in ionic app
#83 opened by kudchikarsk - 1
- 2
Deprecated modules
#58 opened by riyaznet - 0
can't got iframe video from html
#78 opened by ostapetc - 3
Date isn't always ISO format?
#77 opened by mooniker - 3
Cannot use client-side with React Native
#74 opened by joncursi - 0
TypeError: this.lang is not a function
#73 opened by bitcoinvsalts - 0
Grabbing sidebar content
#70 opened by adamrabie - 0
- 0
- 1
How can manage this case ?
#56 opened by christophebe - 1
What coffee does unfluff drink?
#54 opened by bennyk - 0
- 2
- 1
Extract text with line breaks
#55 opened by adrianparr - 2
- 4
do you support open graph
#52 opened by yawhide - 3
Incorrect video extractions
#50 opened by snellingio - 2
Extract author
#48 opened by PetrKaleta - 1
- 1
- 5
Text missing
#38 opened by akreienbring - 1
Twitter status (tweet) as article?
#29 opened by mattpal - 2
- 1
Typo in extractor#isHighlinkDensity ?
#32 opened by dminkovsky - 1
Trim whitespace from tags?
#34 opened by pdehaan - 1
- 2
Ignore Social Buttons
#27 opened by timcosta