Offline Wikipedia reader

From Openmoko

(Difference between revisions)
Jump to: navigation, search
(Development status)
(wiki)
 
(3 intermediate revisions by one user not shown)
Line 1: Line 1:
Instructions can be found here:
+
{{application|offline wikipedia}}.
http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html
+
  
Initial indexing should be carried out on a desktop/laptop, as the freerunner is not powerful enough
+
{{ApplicationBox|
 +
Name=[[Offline Wikipedia reader]]|
 +
Description=Read entirety of Wikipedia offline|
 +
Screenshot=offline_wikipedia.png|
 +
Homepage=http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html|
 +
TestedOn=-|
 +
PackageName=no package yet
 +
}}
  
{{application|offline wikipedia}}
+
The Offline Wikipedia reader is a set of scripts and programmes which can be used to display wikipedia pages without an internet connection.
  
This project provides software to store the entirety of Wikipedia (any language) locally on a Linux device.
+
The English Wikipedia (text only) is around 6GB including indices, so the entire content can be stored on one 8GB card. The German Wikipedia is approximately 1/4 the size, so can be stored on a correspondingly smaller card; ditto for other languages. Asthe file is so large, you may want to remove the microSD card from the Freerunner and use a card reader to transfer the files; copying them over USB will take several hours.
  
All Wikipedia pages are downloaded from the Wikipedia page dump - [http://download.wikimedia.org/enwiki/20081008/]. The files needed is called 'pages-articles.xml.bz2', and the most recent is 4.1GB
+
The software provides a custom lightweight webserver running locally and uses php to present the pages, which are then viewed using any web browser.
  
For more info visit the official website: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html
+
Instructions can be found here:
 +
http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html
  
The English Wikipedia (text only) is around 6GB, so the entire content can be stored on one 8GB device. The German Wikipedia is approximately 1/4 the size, so can be stored on a correspondingly smaller card; ditto for other languages
+
All Wikipedia pages are contained in the Wikipedia page dump - [http://download.wikimedia.org/enwiki/20081008/]. The file needed is called 'pages-articles.xml.bz2', and the most recent is 4.1GB
 +
 
 +
Before the pages can be displayed, an indexer needs to be run - this should be carried out on a desktop/laptop, as it would take many hours on the Freerunner; this takes approximately one hour on a dual-core 1.1GHz cpu, 1.5GB RAM laptop.
 +
 
 +
Dependencies (php, perl, python) are then installed on the freerunner, and the index and database copied across, along with the included web server.
  
The software functions by running a lightweight webserver on the phone, and using php to present the pages, which are then viewed using any web browser.
 
  
 
=Development status=
 
=Development status=
 
+
At present, a single tar.bz is downloaded from the site above, the pages extract downloaded and copied to the correct location, and the indexing process run.  
At present, a single tar.bz is downloaded from the site above, the pages extract downloaded and copied to the correct location, and the indexing process run (takes approximately one hour on a dual-core 1.1GHz cpu, 1.5GB RAM laptop). Dependencies (php, perl ,python) are then installed on the freerunner, and the index and database copied to the freerunner, along with the web server.
+
  
 
In the future the software will be released as an ipk. Also a deb/rpm for a desktop distributions, to allow automatic download of the most recent wikipedia dump, with a diff utility to allow it to be updated and re-indexed.
 
In the future the software will be released as an ipk. Also a deb/rpm for a desktop distributions, to allow automatic download of the most recent wikipedia dump, with a diff utility to allow it to be updated and re-indexed.
 
 
<onlyinclude>
 
{{ApplicationBox|
 
Name=[[Offline Wikipedia reader]]|
 
Description=Read entirety of Wikipedia offline|
 
Screenshot=offline_wikipedia.png|
 
Homepage=http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html|
 
TestedOn=-|
 
PackageName=no package yet
 
}}
 
</onlyinclude>
 
  
 
[[Category:Applications]]
 
[[Category:Applications]]

Latest revision as of 17:32, 19 July 2009

offline wikipedia is one of the applications that runs on the Openmoko Phones. For a list of all applications, visit Applications.

150px Offline Wikipedia reader

Read entirety of Wikipedia offline


Homepage: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html
Package: no package yet
Tested on: -


The Offline Wikipedia reader is a set of scripts and programmes which can be used to display wikipedia pages without an internet connection.

The English Wikipedia (text only) is around 6GB including indices, so the entire content can be stored on one 8GB card. The German Wikipedia is approximately 1/4 the size, so can be stored on a correspondingly smaller card; ditto for other languages. Asthe file is so large, you may want to remove the microSD card from the Freerunner and use a card reader to transfer the files; copying them over USB will take several hours.

The software provides a custom lightweight webserver running locally and uses php to present the pages, which are then viewed using any web browser.

Instructions can be found here: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html

All Wikipedia pages are contained in the Wikipedia page dump - [1]. The file needed is called 'pages-articles.xml.bz2', and the most recent is 4.1GB

Before the pages can be displayed, an indexer needs to be run - this should be carried out on a desktop/laptop, as it would take many hours on the Freerunner; this takes approximately one hour on a dual-core 1.1GHz cpu, 1.5GB RAM laptop.

Dependencies (php, perl, python) are then installed on the freerunner, and the index and database copied across, along with the included web server.


[edit] Development status

At present, a single tar.bz is downloaded from the site above, the pages extract downloaded and copied to the correct location, and the indexing process run.

In the future the software will be released as an ipk. Also a deb/rpm for a desktop distributions, to allow automatic download of the most recent wikipedia dump, with a diff utility to allow it to be updated and re-indexed.

Personal tools

Instructions can be found here: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html

Initial indexing should be carried out on a desktop/laptop, as the freerunner is not powerful enough

offline wikipedia is one of the applications that runs on the Openmoko Phones. For a list of all applications, visit Applications

This project provides software to store the entirety of Wikipedia (any language) locally on a Linux device.

All Wikipedia pages are downloaded from the Wikipedia page dump - [1]. The files needed is called 'pages-articles.xml.bz2', and the most recent is 4.1GB

For more info visit the official website: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html

The English Wikipedia (text only) is around 6GB, so the entire content can be stored on one 8GB device. The German Wikipedia is approximately 1/4 the size, so can be stored on a correspondingly smaller card; ditto for other languages

The software functions by running a lightweight webserver on the phone, and using php to present the pages, which are then viewed using any web browser.

Development status

At present, a single tar.bz is downloaded from the site above, the pages extract downloaded and copied to the correct location, and the indexing process run (takes approximately one hour on a dual-core 1.1GHz cpu, 1.5GB RAM laptop). Dependencies (php, perl ,python) are then installed on the freerunner, and the index and database copied to the freerunner, along with the web server.

In the future the software will be released as an ipk. Also a deb/rpm for a desktop distributions, to allow automatic download of the most recent wikipedia dump, with a diff utility to allow it to be updated and re-indexed.


150px Offline Wikipedia reader

Read entirety of Wikipedia offline


Homepage: http://users.softlab.ece.ntua.gr/~ttsiod/buildWikipediaOffline.html
Package: no package yet
Tested on: -