mirror of
https://github.com/gnosygnu/xowa.git
synced 2026-03-02 03:49:30 +00:00
c
This commit is contained in:
@@ -103,13 +103,13 @@
|
||||
XOWA reads every page title and breaks it up into words
|
||||
</li>
|
||||
<li>
|
||||
XOWA stores this data in an <a href="/site/en.wikipedia.org/wiki/Inverted_index">inverted index</a>. From a database standpoint, they are placed in two database tables called search_word and search_link.
|
||||
XOWA stores this data in an <a href="https://en.wikipedia.org/wiki/Inverted_index" rel="nofollow" class="external text">inverted index</a>. From a database standpoint, they are placed in two database tables called search_word and search_link.
|
||||
</li>
|
||||
<li>
|
||||
XOWA then downloads a list of pagelinks from Wikimedia's dump servers. For example, for 2016-03 English Wikipeda, the link is <a href="http://dumps.wikimedia.org/enwiki/20160305/enwiki-20160305-pagelinks.sql.gz" rel="nofollow" class="external free">http://dumps.wikimedia.org/enwiki/20160305/enwiki-20160305-pagelinks.sql.gz</a>
|
||||
</li>
|
||||
<li>
|
||||
XOWA then parses this data and calculates <a href="/site/en.wikipedia.org/wiki/PageRank">PageRank</a> based on which page links to which page
|
||||
XOWA then parses this data and calculates <a href="https://en.wikipedia.org/wiki/PageRank" rel="nofollow" class="external text">PageRank</a> based on which page links to which page
|
||||
</li>
|
||||
<li>
|
||||
XOWA then applies a series of calculations to come up with a page score for each page. For more info, see <a href="/wiki/Help/Features/Search/Score" id="xolnki_3" title="Help/Features/Search/Score">Help/Features/Search/Score</a>
|
||||
|
||||
Reference in New Issue
Block a user