The archive was noticed by a person on Reddit who shared it in a post. The person included a link to the website for Kiwix, where the 89 GB ZIM file (a format for offline wiki content) is available for download.
Wikipedia routinely makes a dump of its databases available publicly, which Kiwix then compresses into an archive so it can be more easily shared. For its latest archive, Kiwix used Wikipedia’s database dump made on June 23. Stephane Coillet-Matillon is the co-founder of Kiwix and told Motherboard the most recent archive has been available on the Kiwix website since early July.
“Essentially, we’re trying to make a copy of the whole internet for offline use,” Coillet-Matillon said. “So we get stock exchange, we get Ted Talks. Anyone can come up with a request and if we can make a copy and it’s legal, it’s fine, and we distribute it.”
The Wikimedia Foundation, the nonprofit that hosts Wikipedia, has funded part of Kiwix’s work, Coillet-Matillon said.
Kiwix makes and distributes archives of Wikipedias in all languages, but the English version is by far the largest. A new archive of the entire English Wikipedia by Kiwix hasn’t been available since October 2018.
“There have been many dumps released since October '18, but we failed every time,” Coillet-Matillon said. “Back then the whole process took 3 weeks, and as per Murphy's law would crash on day 20, sending us back to square one.”
Coillet-Matillon said the main reason for providing the archives is so that people without access to the internet can still have access to Wikipedia.
“There’s 4 billion people without internet access in the world,” Coillet-Matillon said. “It took 30 years for the richest half of the world to be connected to the web. I don’t think it’s going be any easier or faster to connect the poorest half.”
Get a personalized roundup of VICE's best stories in your inbox.
By signing up to the VICE newsletter you agree to receive electronic communications from VICE that may sometimes include advertisements or sponsored content.