1
0
mirror of https://github.com/gnosygnu/xowa.git synced 2026-03-02 03:49:30 +00:00

fix_cli_for_fsdb_make_v1a

This commit is contained in:
gnosygnu
2020-09-07 22:43:20 -04:00
parent 52523ced53
commit 2fa92d6e34
161 changed files with 2406 additions and 5103 deletions

View File

@@ -114,7 +114,7 @@
</h3>
<ul>
<li>
<b>Searches sort by page score</b>: By convention, XOWA scores every page with percentile ranking ("graded curve") from 0 to 1,000,000. Page score is based on <a href="/site/en.wikipedia.org/wiki/PageRank">https://en.wikipedia.org/wiki/PageRank</a> . In short, a page has a high page score if there are many pages linking to it. For more info, see <a href="/wiki/App/Search/Score" id="xolnki_5" title="App/Search/Score">App/Search/Score</a>
<b>Searches sort by page score</b>: By convention, XOWA scores every page with percentile ranking ("graded curve") from 0 to 1,000,000. Page score is based on <a href="https://en.wikipedia.org/wiki/PageRank" rel="nofollow" class="external free">https://en.wikipedia.org/wiki/PageRank</a> . In short, a page has a high page score if there are many pages linking to it. For more info, see <a href="/wiki/App/Search/Score" id="xolnki_5" title="App/Search/Score">App/Search/Score</a>
</li>
<li>
<b>Searches find a page that matches the text and list it as the first result</b>: For example, searching for "earth" will automatically find the page "Earth" and list it as the first result, regardless of its score. (Technically, this entry is assigned a temporary score of 3,000,000)

View File

@@ -65,10 +65,10 @@
</p>
<ul>
<li>
A <a href="/site/en.wikipedia.org/wiki/PageRank">PageRank</a> score is calculated for a page. This score is <a href="/site/en.wikipedia.org/wiki/Feature_scaling">scaled</a> from 0 to 1,000,000
A <a href="https://en.wikipedia.org/wiki/PageRank" rel="nofollow" class="external text">PageRank</a> score is calculated for a page. This score is <a href="https://en.wikipedia.org/wiki/Feature_scaling" rel="nofollow" class="external text">scaled</a> from 0 to 1,000,000
</li>
<li>
A page-length score is then calculated for the page. This score is <a href="/site/en.wikipedia.org/wiki/Percentile_rank">ranked</a> from 0 to 1,000,000
A page-length score is then calculated for the page. This score is <a href="https://en.wikipedia.org/wiki/Percentile_rank" rel="nofollow" class="external text">ranked</a> from 0 to 1,000,000
</li>
<li>
The PageRank score is then multiplied by a ratio if it has a low page-length score
@@ -90,7 +90,7 @@
<span class="mw-headline" id="Scaling">Scaling</span>
</h3>
<p>
A simplified definition of scaling is converting a number from one range to another range based on proportion. For a more thorough definition, see <a href="/site/en.wikipedia.org/wiki/Feature_scaling">the Wikipedia page on feature scaling</a>
A simplified definition of scaling is converting a number from one range to another range based on proportion. For a more thorough definition, see <a href="https://en.wikipedia.org/wiki/Feature_scaling" rel="nofollow" class="external text">the Wikipedia page on feature scaling</a>
</p>
<p>
For example, let's say you have a score of 100 in a range of 0 to 400 and want to scale it to 0 to 1000. The following steps would be involved:
@@ -123,7 +123,7 @@ newScore = \frac{oldScore - \text{min}(oldRange)} {\text{max}(oldRange)-\text{mi
<span class="mw-headline" id="Ranking">Ranking</span>
</h3>
<p>
A simplified definition of ranking is assigning a number based on its order in a population of numbers. For those familiar with a school setting, this is "grading on a curve". For a more thorough definition, see <a href="/site/en.wikipedia.org/wiki/Percentile_rank">the Wikipedia page on percentile ranks</a>
A simplified definition of ranking is assigning a number based on its order in a population of numbers. For those familiar with a school setting, this is "grading on a curve". For a more thorough definition, see <a href="https://en.wikipedia.org/wiki/Percentile_rank" rel="nofollow" class="external text">the Wikipedia page on percentile ranks</a>
</p>
<p>
For example, let's say you have the following:
@@ -219,7 +219,7 @@ newScore = \frac{oldScore - \text{min}(oldRange)} {\text{max}(oldRange)-\text{mi
<span class="mw-headline" id="PageRank">PageRank</span>
</h3>
<p>
The basis of XOWA's page score is <a href="/site/en.wikipedia.org/wiki/PageRank">PageRank</a>.
The basis of XOWA's page score is <a href="https://en.wikipedia.org/wiki/PageRank" rel="nofollow" class="external text">PageRank</a>.
</p>
<p>
In brief, PageRank will give high scores to pages which are:
@@ -233,7 +233,7 @@ newScore = \frac{oldScore - \text{min}(oldRange)} {\text{max}(oldRange)-\text{mi
</li>
</ol>
<p>
Note that #2 is recursive (a page will have a high score only if it is linked to by many pages). For more info, a good starting point is <a href="/site/en.wikipedia.org/wiki/PageRank">the Wikipedia page on PageRank</a>.
Note that #2 is recursive (a page will have a high score only if it is linked to by many pages). For more info, a good starting point is <a href="https://en.wikipedia.org/wiki/PageRank" rel="nofollow" class="external text">the Wikipedia page on PageRank</a>.
</p>
<p>
After XOWA calculates the PageRank, XOWA then scales this score in a range of 0 to 1,000,000

View File

@@ -59,13 +59,13 @@
For example, after searching for Earth, the url string will read:
</p>
<p>
<code><a href="/site/en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=0">http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=0</a></code>
<code><a href="http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=0" rel="nofollow" class="external free">http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=0</a></code>
</p>
<p>
If you want to hide redirects, change this to:
</p>
<p>
<code><a href="/site/en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=">http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=</a><span style="color:red;">1</span></code>
<code><a href="http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=" rel="nofollow" class="external free">http://en.wikipedia.org/wiki/Special:Allpages?from=Earth&amp;namespace=0&amp;hideredirects=</a><span style="color:red;">1</span></code>
</p>
<p>
If you want to search in a different namespace, you have two options