At the moment Outernet are automatically grabbing pages from the 5000 most popular
@syed I’d suggest it would be more topical to just get the top page from here every day (Excluding some porn, 404 pages & a few TV station pages that don’t seem that interesting) What do you think?
It should be as easy as wget on the top link from that page, then once a file has been broadcast you could stick it in a blacklist to not be re-broadcast for X months…
(Just picking some random dates; You’d have got David Bowie’s page the day after his death Brexit a couple of days after it happened Pokemon Go a couple of days after release Michael Phelps at the Olympics The Eurovision song contest result)
In short I think it would be a good way to get some in-depth info on stuff that’s happening, along with a mix of random stuff, like the Anterior interventricular branch of left coronary artery & The Black Robin & The Clathrate gun hypothesis
Just noticed you can do the same for other languages like French, Spanish Arabic Chinese, Turkish, Russian so this would be a really nice way to diversify content, and provide topical information to people where Wikipedia and other news sources are blocked.
Say it’s 500kb for a compressed Wikipedia page (220 after compresstion) on average you’d be looking at 1.5mb a day to do 7 languages. I think it would do a lot for the perception of the project to do so though.