mirror of
https://github.com/gnosygnu/xowa.git
synced 2026-03-02 03:49:30 +00:00
blog_new_job[D[D[D[D[D[D[D_
This commit is contained in:
@@ -59,7 +59,7 @@
|
||||
</ul>
|
||||
</div>
|
||||
<h2>
|
||||
<span class="mw-headline" id="Overview">Overview</span>
|
||||
<span class="mw-headline" id="Overview">Overview</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Overview" title="Edit section: Overview" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h2>
|
||||
<p>
|
||||
As of v3.3.4.1, XOWA has a new search engine. This uses the same search functionality that is in XOWA Android.
|
||||
@@ -93,7 +93,7 @@
|
||||
<br>
|
||||
</p>
|
||||
<h2>
|
||||
<span class="mw-headline" id="Process">Process</span>
|
||||
<span class="mw-headline" id="Process">Process</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Process" title="Edit section: Process" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h2>
|
||||
<p>
|
||||
XOWA will generate a search index during the wiki import process. The following steps occur:
|
||||
@@ -119,7 +119,7 @@
|
||||
<br>
|
||||
</p>
|
||||
<h2>
|
||||
<span class="mw-headline" id="Build_process">Build process</span>
|
||||
<span class="mw-headline" id="Build_process">Build process</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Build_process" title="Edit section: Build_process" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h2>
|
||||
<p>
|
||||
Due to the nature of the PageRank algorithm, a lot of additional time and disk-space is needed. These requirements are especially dramatic for English Wikipedia:
|
||||
@@ -156,13 +156,13 @@
|
||||
With this in mind, XOWA offers the following options:
|
||||
</p>
|
||||
<h3>
|
||||
<span class="mw-headline" id="Download_the_XOWA_search_databases_from_archive.org">Download the XOWA search databases from archive.org</span>
|
||||
<span class="mw-headline" id="Download_the_XOWA_search_databases_from_archive.org">Download the XOWA search databases from archive.org</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Download_the_XOWA_search_databases_from_archive.org" title="Edit section: Download_the_XOWA_search_databases_from_archive.org" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h3>
|
||||
<p>
|
||||
Monthly versions of English Wikipedia's search databases will be posted to <a href="https://archive.org/edit/Xowa_enwiki_latest" rel="nofollow" class="external free">https://archive.org/edit/Xowa_enwiki_latest</a> . You can just download a 2 GB dump of these databases and replace your copies.
|
||||
</p>
|
||||
<h3>
|
||||
<span class="mw-headline" id="Use_page-length_instead_of_PageRank">Use page-length instead of PageRank</span>
|
||||
<span class="mw-headline" id="Use_page-length_instead_of_PageRank">Use page-length instead of PageRank</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Use_page-length_instead_of_PageRank" title="Edit section: Use_page-length_instead_of_PageRank" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h3>
|
||||
<p>
|
||||
XOWA can use page-length and skip the pagelinks download (125+ GB) as well as the PageRank running time (8+ hours). However the generated results will not be as accurate as PageRank. Specifically, long pages like "List of ...." will have a high page score.
|
||||
@@ -188,13 +188,13 @@
|
||||
Note that 0 is the default value for this option.
|
||||
</p>
|
||||
<h3>
|
||||
<span class="mw-headline" id="Use_PageRank_but_limit_to_1_iteration">Use PageRank but limit to 1 iteration</span>
|
||||
<span class="mw-headline" id="Use_PageRank_but_limit_to_1_iteration">Use PageRank but limit to 1 iteration</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Use_PageRank_but_limit_to_1_iteration" title="Edit section: Use_PageRank_but_limit_to_1_iteration" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h3>
|
||||
<p>
|
||||
This option will still require a lot of disk space, but will limit the running time to a few hours. To use this option, do the same as above, but change "PageRank iteration count" to 1.
|
||||
</p>
|
||||
<h3>
|
||||
<span class="mw-headline" id="Use_PageRank_but_limit_to_1000_iteration">Use PageRank but limit to 1000 iteration</span>
|
||||
<span class="mw-headline" id="Use_PageRank_but_limit_to_1000_iteration">Use PageRank but limit to 1000 iteration</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Use_PageRank_but_limit_to_1000_iteration" title="Edit section: Use_PageRank_but_limit_to_1000_iteration" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h3>
|
||||
<p>
|
||||
This option will create the full version of PageRank search indexes. To use this option, do the same as above, but change "PageRank iteration count" to 1000.
|
||||
@@ -203,7 +203,7 @@
|
||||
<br>
|
||||
</p>
|
||||
<h2>
|
||||
<span class="mw-headline" id="Other_notes">Other notes</span>
|
||||
<span class="mw-headline" id="Other_notes">Other notes</span> <span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/wiki/Help/Features/Search/Build?action=edit&section_key=Other_notes" title="Edit section: Other_notes" class="xowa-hover-off">edit</a><span class="mw-editsection-bracket">]</span></span>
|
||||
</h2>
|
||||
<ul>
|
||||
<li>
|
||||
|
||||
Reference in New Issue
Block a user