2016-04-17 15:48:03 +00:00
<!DOCTYPE html>
< html dir = "ltr" >
< head >
< meta http-equiv = "content-type" content = "text/html;charset=UTF-8" / >
< title > App/Import/English Wikipedia - XOWA< / title >
< link rel = "shortcut icon" href = "https://gnosygnu.github.io/xowa/xowa_logo.png" / >
< link rel = "stylesheet" href = "https://gnosygnu.github.io/xowa/xowa_common.css" type = "text/css" >
< / head >
< body class = "mediawiki ltr sitedir-ltr ns-0 ns-subject skin-vector action-submit vector-animateLayout" spellcheck = "false" >
< div id = "mw-page-base" class = "noprint" > < / div >
< div id = "mw-head-base" class = "noprint" > < / div >
< div id = "content" class = "mw-body" >
< h1 id = "firstHeading" class = "firstHeading" > < span > App/Import/English Wikipedia< / span > < / h1 >
< div id = "bodyContent" class = "mw-body-content" >
< div id = "siteSub" > From XOWA: the free, open-source, offline wiki application< / div >
< div id = "contentSub" > < / div >
< div id = "mw-content-text" lang = "en" dir = "ltr" class = "mw-content-ltr" >
< div id = "toc" class = "toc" >
< div id = "toctitle" >
< h2 >
Contents
< / h2 >
< / div >
< ul >
< li class = "toclevel-1 tocsection-1" >
2016-10-06 06:45:30 +00:00
< a href = "#Background" > < span class = "tocnumber" > 1< / span > < span class = "toctext" > Background< / span > < / a >
2016-04-17 15:48:03 +00:00
< / li >
< li class = "toclevel-1 tocsection-2" >
2016-10-06 06:45:30 +00:00
< a href = "#Quick_start" > < span class = "tocnumber" > 2< / span > < span class = "toctext" > Quick start< / span > < / a >
2016-04-17 15:48:03 +00:00
< ul >
< li class = "toclevel-2 tocsection-3" >
2016-10-06 06:45:30 +00:00
< a href = "#Articles" > < span class = "tocnumber" > 2.1< / span > < span class = "toctext" > Articles< / span > < / a >
2016-04-17 15:48:03 +00:00
< ul >
< li class = "toclevel-3 tocsection-4" >
2016-10-06 06:45:30 +00:00
< a href = "#Download_pre-built_wikis_from_archive.org" > < span class = "tocnumber" > 2.1.1< / span > < span class = "toctext" > Download pre-built wikis from archive.org< / span > < / a >
2016-04-17 15:48:03 +00:00
< / li >
< li class = "toclevel-3 tocsection-5" >
2016-10-06 06:45:30 +00:00
< a href = "#Build_wikis_using_the_database_dumps_at_wikimedia.org" > < span class = "tocnumber" > 2.1.2< / span > < span class = "toctext" > Build wikis using the database dumps at wikimedia.org< / span > < / a >
2016-04-17 15:48:03 +00:00
< / li >
< / ul >
< / li >
< li class = "toclevel-2 tocsection-6" >
2016-10-06 06:45:30 +00:00
< a href = "#Images" > < span class = "tocnumber" > 2.2< / span > < span class = "toctext" > Images< / span > < / a >
2016-09-19 03:24:34 +00:00
< / li >
< / ul >
< / li >
2016-10-06 06:45:30 +00:00
< li class = "toclevel-1 tocsection-7" >
< a href = "#Detailed_start" > < span class = "tocnumber" > 3< / span > < span class = "toctext" > Detailed start< / span > < / a >
2016-04-17 15:48:03 +00:00
< / li >
< / ul >
< / div >
< h2 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Background" > Background< / span >
2016-04-17 15:48:03 +00:00
< / h2 >
< p >
2016-10-06 06:45:30 +00:00
English Wikipedia has a lot of data. There are over 16 million pages with 25 GB of text, as well as 5 million images.
2016-04-17 15:48:03 +00:00
< / p >
< p >
2016-09-19 03:24:34 +00:00
Setting all this up on your computer will not be a quick process. As a general estimate, you will need at least 30 GB and 5 hours processing time. If you want images as well, the numbers increase to 100 GB of disk space and 30+ hours of processing time. However, when you are done, you will have a complete, recent copy of English Wikipedia with images that can fit on a 128 GB SD card.
2016-04-17 15:48:03 +00:00
< / p >
< p >
Although the process itself is not hard, I < b > strongly recommend< / b > that you try Simple Wikipedia first. Simple Wikipedia has 180,000 pages and 90,000 images. The text version uses 200 MB and sets up in 5 minutes. With images, this expands to 2 GB and 30 minutes of downloading time. Simple Wikipedia is a reasonably accurate simulation of English Wikipedia -- just much smaller. It'll also give you a pretty good idea of what XOWA can do.
< / p >
< h2 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Quick_start" > Quick start< / span >
2016-04-17 15:48:03 +00:00
< / h2 >
< h3 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Articles" > Articles< / span >
2016-04-17 15:48:03 +00:00
< / h3 >
2016-10-06 06:45:30 +00:00
< p >
There are two approaches:
< / p >
2016-04-17 15:48:03 +00:00
< ul >
< li >
2016-10-06 06:45:30 +00:00
Download pre-built wikis from archive.org
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Build wikis using the database dumps at wikimedia.org
2016-04-17 15:48:03 +00:00
< / li >
< / ul >
< h4 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Download_pre-built_wikis_from_archive.org" > Download pre-built wikis from archive.org< / span >
2016-04-17 15:48:03 +00:00
< / h4 >
< ul >
< li >
2016-10-06 06:45:30 +00:00
Go to < code > Special:XowaDownloadCentral< / code > from the URL bar within XOWA. You can also do Main Menu -> Tools -> Download Central
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
In the < b > Available< / b > table, add the task names that start off with < code > English Wikipedia - Articles< / code >
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
In the < b > Queue< / b > table, click the Play button
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Wait for the Queue to process. When it is done, enter the wiki URL in the bar. For example, < code > en.wikipedia.org< / code >
2016-04-17 15:48:03 +00:00
< / li >
< / ul >
< h4 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Build_wikis_using_the_database_dumps_at_wikimedia.org" > Build wikis using the database dumps at wikimedia.org< / span >
2016-04-17 15:48:03 +00:00
< / h4 >
2016-09-19 03:24:34 +00:00
< ul >
< li >
2017-02-12 20:32:44 +00:00
Go to < a href = "http://xowa.org/home/wiki/Dashboard/Import/Online.html" id = "xolnki_2" title = "Dashboard/Import/Online" class = "xowa-visited" > Dashboard/Import/Online< / a >
2016-09-19 03:24:34 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Find < code > en.wikipedia.org< / code > in the list
2016-09-19 03:24:34 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Click the < code > download< / code > link.
2016-09-19 03:24:34 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Wait for the wiki to build. When it is done, it will automatically load the Main Page
2016-09-19 03:24:34 +00:00
< / li >
< / ul >
< h3 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Images" > Images< / span >
2016-09-19 03:24:34 +00:00
< / h3 >
< ul >
< li >
2016-10-06 06:45:30 +00:00
Go to < code > Special:XowaDownloadCentral< / code > from the URL bar within XOWA. You can also do Main Menu -> Tools -> Download Central
2016-09-19 03:24:34 +00:00
< / li >
2016-04-17 15:48:03 +00:00
< li >
2016-10-06 06:45:30 +00:00
In the < b > Available< / b > table, add the task names that start off with < code > English Wikipedia - Images< / code > and < code > English Wikipedia - Image Update< / code >
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
In the < b > Queue< / b > table, click the Play button
2016-04-17 15:48:03 +00:00
< / li >
< li >
2016-10-06 06:45:30 +00:00
Wait for the Queue to process. When it is done, enter any page in the bar. For example, < code > en.wikipedia.org/wiki/Earth< / code >
2016-04-17 15:48:03 +00:00
< / li >
< / ul >
2016-09-19 03:24:34 +00:00
< p >
< br >
< / p >
2016-04-17 15:48:03 +00:00
< h2 >
2016-10-06 06:45:30 +00:00
< span class = "mw-headline" id = "Detailed_start" > Detailed start< / span >
2016-04-17 15:48:03 +00:00
< / h2 >
< p >
2017-02-02 16:48:15 +00:00
See < a href = "http://xowa.org/home/wiki/Wiki_setup/English_wikis.html" id = "xolnki_3" title = "Wiki setup/English wikis" class = "xowa-visited" > Wiki_setup/English_wikis< / a >
2016-04-17 15:48:03 +00:00
< / p >
2016-09-12 01:53:06 +00:00
2016-04-17 15:48:03 +00:00
< / div >
< / div >
< / div >
< div id = "mw-head" class = "noprint" >
< div id = "left-navigation" >
< div id = "p-namespaces" class = "vectorTabs" >
< h3 > Namespaces< / h3 >
< ul >
< li id = "ca-nstab-main" class = "selected" > < span > < a id = "ca-nstab-main-href" href = "index.html" > Page< / a > < / span > < / li >
< / ul >
< / div >
< / div >
< / div >
< div id = 'mw-panel' class = 'noprint' >
< div id = 'p-logo' >
< a style = "background-image: url(https://gnosygnu.github.io/xowa/xowa_logo.png);" href = "http://xowa.org/" title = "Visit the main page" > < / a >
< / div >
< div class = "portal" id = 'xowa-portal-home' >
< h3 > XOWA< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://xowa.org/index.html" title = 'Visit the main page' > Main page< / a > < / li >
< li > < a href = "http://xowa.org/screenshots.html" title = 'See screenshots of XOWA' > Screenshots< / a > < / li >
2016-06-26 06:10:12 +00:00
< li > < a href = "https://www.youtube.com/watch?v=q0qbXYXEH6M" title = "See a video of XOWA Desktop in action" > Video< / a > < / li >
2016-04-17 15:48:03 +00:00
< li > < a href = "http://xowa.org/home/wiki/Help/Download_XOWA.html" title = 'Download the XOWA application' > Download XOWA< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/Dashboard/Image_databases.html" title = 'Download offline wikis and image databases' > Download wikis< / a > < / li >
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-started' >
< h3 > Getting started< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://xowa.org/home/wiki/App/Setup/System_requirements.html" title = 'Get XOWA's system requirements' > Requirements< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/App/Setup/Installation.html" title = 'Get instructions for installing XOWA' > Installation< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/App/Import/Simple_Wikipedia.html" title = 'Learn how to set up Simple Wikipedia' > Simple Wikipedia< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/App/Import/English_Wikipedia.html" title = 'Learn how to set up English Wikipedia' > English Wikipedia< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/App/Import/Other_wikis.html" title = 'Learn how to set up other Wikipedias' > Other Wikipedias< / a > < / li >
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-android' >
< h3 > Android< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://xowa.org/home/wiki/Android/Setup.html" title = 'Setup XOWA on your Android device' > Setup< / a > < / li >
2016-06-26 06:10:12 +00:00
< li > < a href = "https://www.youtube.com/watch?v=jsMTBxGweUw" title = "See a video of XOWA Android in action" > Video< / a > < / li >
2016-04-17 15:48:03 +00:00
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-help' >
< h3 > Help< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://xowa.org/home/wiki/Help/About.html" title = 'Get more information about XOWA' > About< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/Help/Contents.html" title = 'View a list of help topics' > Contents< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/Help/Media.html" title = 'Read what others have written about XOWA' > Media< / a > < / li >
< li > < a href = "http://xowa.org/home/wiki/Help/Feedback.html" title = 'Questions? Comments? Leave feedback for XOWA' > Feedback< / a > < / li >
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-blog' >
< h3 > Blog< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://xowa.org/home/wiki/Blog.html" title = 'Follow XOWA' ' s development process ' > Current< / a > < / li >
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-links' >
< h3 > Links< / h3 >
< div class = "body" >
< ul >
< li > < a href = "http://dumps.wikimedia.org/backup-index.html" title = "Get wiki datababase dumps directly from Wikimedia" > Wikimedia dumps< / a > < / li >
< li > < a href = "https://archive.org/search.php?query=xowa" title = "Search archive.org for XOWA files" > XOWA @ archive.org< / a > < / li >
< li > < a href = "http://en.wikipedia.org" title = "Visit Wikipedia (and compare to XOWA!)" > English Wikipedia< / a > < / li >
< / ul >
< / div >
< / div >
< div class = "portal" id = 'xowa-portal-donate' >
< h3 > Donate< / h3 >
< div class = "body" >
< ul >
< li > < a href = "https://archive.org/donate/index.php" title = "Support archive.org!" > archive.org< / a > < / li > <!-- listed first due to recent fire damages: http://blog.archive.org/2013/11/06/scanning - center - fire - please - help - rebuild/ -->
< li > < a href = "https://donate.wikimedia.org/wiki/Special:FundraiserRedirector" title = "Support Wikipedia!" > Wikipedia< / a > < / li >
2017-02-12 20:32:44 +00:00
< li >
< form action = "https://www.paypal.com/cgi-bin/webscr" method = "post" target = "_top" >
< input type = "hidden" name = "cmd" value = "_s-xclick" >
< input type = "hidden" name = "hosted_button_id" value = "PQ529NWHCGDQ2" >
< div style = "display:flex" >
< span style = "margin:2px 4px 0px 0px" > XOWA< / span >
< input type = "image" height = 14 src = "https://www.paypalobjects.com/en_US/i/btn/btn_donate_SM.gif" border = "0" name = "submit" alt = "PayPal - The safer, easier way to pay online!" >
<!-- <img alt="" border="0" src="https://www.paypalobjects.com/en_US/i/scr/pixel.gif" width="1" height="1"> -->
< / div >
< / form >
< / li >
2016-04-17 15:48:03 +00:00
< / ul >
< / div >
< / div >
2016-04-17 18:00:49 +00:00
2016-04-17 15:48:03 +00:00
< / div >
< / body >
< / html >