results from Nitter feeds don't include the tweet author
kylrth opened this issue · 7 comments
With the following pipe definition:
[sources.lexfridman]
url = "https://nitter.nl/lexfridman/search/rss?f=tweets&e-replies=on"
type = "feed"
filters = ["unseen", "links", "includelink", "digest"]
I get an email that looks like this:
As you can see, the tweet body appears as both the title and the content. I'd ideally like the title to be the tweet author name (which may be different from @lexfridman in the case of retweets or if I'm making a digest of multiple Twitter users), and then let the body be the same.
I know you've got that replace
filter for simple text manipulation in the body, but have you got any ideas about more complex field manipulation that might make it possible to insert the <dc:creator>
tag into the title or something like that?
I see, let me check how I could do something along those lines.
Thanks.
I've looked at the RSS feed given by nitter and it's not that trivial. We could create a special 'nitter' filter (as I don't believe Twitter give native RSS) feed, but that seems a bit contrived.
The real issue here would be to be able to put the dc:creator
field inside the title field. First I have to see if it's exposed by the RSS library, maybe it's specific to some feeds and not always present. Secondly, it would be weird to have a filter that does this but does not allow to do it for description etc.
The "good" solution IMHO would be to have the long term script solution, where I would provide a way to write simple manipulations of the feed, where it should be as simple as:
item.Title = item.Author
but we are not there yet. I think I will do a "authorastitle" (or better name) filter and remove it when we have a better solution.
Hello, I didn't forget about this feature but still haven't found a satisfying way of implementing it.
Yeah no worries, I wouldn't want you to implement a hacky solution specific to my particular request.
@slurdge suggestion: How about a field mapper, where we can define in the toml which fields from the rss will be mapped to which goeland fields ?
That's a good idea, however the fact is not all feed (let's say imgur source for example) would have all fields. I guess this is also something that would be better tailored once I have a small scripting engine, as this would be trivial to do so with it.
Still I keep the suggestion, maybe a filter with a simple syntax would fit the bill