Purpose
Fix Up RSS (and atom) fixes rss feeds without full text.
It works by taking a feed URL and one or more XPATH expressions to extract the full text. For each article in the feed, it fetches the pointed-to article, does the XPATH extractions, and packages the result as the new feed.
It also does some minor things that are intended to unbreak the web: It resolves feed links to the article they ultimately point at (so that e.g., when google shuts down feedproxy.google.com, URLs you copied & pasted from feeds will still work) and it removes "utm_ trackers" that it recognizes from URLs in order to protect your privacy.
Development status
The author (@jepler) is actively using the project and it meets his needs. The source is shared in the spirit of open source, but Issues and pull requests are not likely to be acted on. I would be interested in passing this project to an active maintainer who wants to make it into a community project.
Tested with
- Python 3.11
- feedparser 6.0.10
- BeautifulSoup4 4.11.2
Optional:
- memcached (the default debian/ubuntu packaging sets up memcached compatibly)
Configuration
furss is configured via an rc file (really, a Python script), either
~/.furssrc
or the file named on the commandline. The defaults are shown near
the top of the furss.py
script. furssrc.sample
shows a sample
configuration.
License
The following license is granted by the authors for all code in this repository:
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
Individual files may also offer more liberal licenses.