Word Count: Difference between revisions

From Librivox wiki
Jump to navigationJump to search
Content deleted Content added
→‎Gutencount (word counter): Bonus - count words from search results page
Msfry (talk | contribs)
→‎Websites: added "ype or paste in your text; "
 
(27 intermediate revisions by 2 users not shown)
Line 1: Line 1:
Here are a number of useful tools to help both readers and book coordinators get accurate word counts.
LibriVox member ''peegee'' has written a script for web browsers which may make the BC's job of compiling word counts for the Magic Window a little easier.


== How it Works ==
== Websites ==
Below are various websites that provide an interface to count the number of words in copy-pasted text. No word limits to what you can paste in, but partial word count of highlighted portions of text is either not available, or requires scrolling to find. All identify the top 10 keywords in the text, check spelling, and to accomodate writers, give a character count useful for typing responses in Twitter, Facebook, etc.
The script runs against the HTML ebooks on [http://www.gutenberg.org/wiki/Main_Page Project Gutenberg].
# you click the paragraph where you want to start the count,
# it asks you the target number of words,
# it quickly goes through every paragraph from that point onwards and counts the words and the running total
# it stops when it reaches the target, or the end of the chapter if before
# the page is temporarily changed to display the word counts right there at the end of each paragraph
# you can repeat this as many times as you like, each time you click a paragraph the temporary page changes are removed


* [https://chrome.google.com/webstore/detail/word-counter-plus/] Word Counter Plus, extension installs in Edge, Chrome, Brave and other browsers upon Get. Counts words and characters in highlighted segment, in a hover over window.
== Screenshots ==
* [https://wordcounter.net/ wordcounter.net] counts highlighted segments, but word count is displayed at bottom.
Here are a few screenshots to illustrate the process:
* [https://wordcounter.io/ wordcounter.io] type or paste in your text; highlighted portions not counted
* [https://thewordcounter.com/ thewordcounter.com] type or paste in your text; highlighted portions not counted
* [https://easywordcount.com/ easywordcount.com] type or paste in your text; highlighted portions not counted
* [https://wordstotime.com/ wordstotime.com] gives an estimated reading time (depending on your speaking speed -- slow, medium, fast) as well as word count.


== Document Editors ==
<gallery>
Image:Wordcount1.png|The first prompt
Image:Wordcount2.png|Choose target number
Image:Wordcount3.png|The count displayed
</gallery>


Most document editors (Microsoft Word, Google Docs, LibreOffice, etc) and some basic text editors have a built-in word counting feature. Copy and paste your text into a document, select the text you want to count.
== Installing the Script ==
The method of installation depends on the browser (Firefox and Chrome may need to be re-started after installation, Opera does not):


* Microsoft Word
=== Firefox ===
** '''Review &rarr; Word Count'''
# [https://addons.mozilla.org/en-US/firefox/addon/748 Greasemonkey] You will first need the GreaseMonkey add-on for Firefox which is available from this link.
** MS Word will also automatically display the word count on the bottom status bar, unless this feature has been disabled. To re-enable it, right-click on the bottom status bar and tick the Word Count option. Just highlight the words you want counted, either part or all of the text. Wait a moment and the word count will display (lower left corner) This is a powerful program, great for dividing up whole books into sections, visually dividing long chapters into approximately equal segments, etc. Video tutorial to be added soon.
# [http://userscripts-mirror.org/scripts/show/69969 Firefox Install] Once you have GreaseMonkey installed, go here and click on the '''Install''' button
* LibreOffice Writer
** '''Tools &rarr; Word Count'''
** A running word count is displayed on the bottom status bar. The status bar cannot be modified but the entire bar may be toggled hidden or displayed by selecting '''View &rarr; Status Bar'''
* Google Docs
** '''Tools &rarr; Word count'''
** Keyboard shortcut: '''Ctrl + Shift + C'''


== Web Browser Bookmarklets ==
=== Google Chrome ===
#[https://chrome.google.com/webstore/detail/tampermonkey/dhdgffkkebhmkfjojejmpbldmpobfkfo?hl=en TamperMonkey] First you'll need to install TamperMonkey from the Chrome store.
#[http://userscripts-mirror.org/scripts/show/70234 WordCount] Then, you'll be able to install the word count extension from here.


These bookmarklets can be used with most modern web browsers. The code for these scripts can be found by following the links below, or you can find them as Tampermonkey/Greasemonkey scripts on [https://greasyfork.org GreasyFork.org].
=== Opera ===
# To enable User JavaScript, use Tools > Preferences > Advanced > Content > JavaScript options, and select the directory where you will put your User JavaScript files (probably best if its a new folder with nothing else in it).
# [http://userscripts-mirror.org/scripts/show/70234 the script] then go to the script
# click the Install button to get the script in a new tab in Opera.
# from the Opera menu click ''File > Save As'' to save it into the folder you chose in the first step (probably best to just keep the suggested file-name - whatever name you choose it MUST end in .user.js )


=== Gutencount (word counter) ===
== Limitations ==
The script only works on the Gutenberg online HTML books, not the ''text'' or ''zipped HTML'', or other formats.

== Support ==
If you have any problems installing or using this script you can either post a message in [http://forum.librivox.org/viewtopic.php?t=24437 this forum thread] or send a Private Message to ''peegee''.


<u>Get the script here</u>: [https://vox.quartertone.net/#gc Gutencount on vox.quartertone.net].<br/>
== Other Alternatives ==
Also available as a [https://www.tampermonkey.net Tampermonkey]/Greasemonkey script, which can be found here: [https://greasyfork.org/en/scripts/463785-gutencount Gutencount on GreasyFork.org]<br/>
(The Tampermonkey version will not activate on non-Gutenberg sites)


Here are some [https://www.youtube.com/playlist?list=PLWMFM-KiRM4MXGQsT0nArfrxTZKuUDlgG tutorial videos] on how to use this script.
=== Web Browser Bookmarklets ===


This is a "universal" word counter bookmarklet. With a single click, you can get an accurate word count for the body of a Gutenberg book.
==== Gutencount (word counter) ====
This is a "universal" word counter bookmarklet. The code below will, with a single click, return the word count for the body of a Gutenberg book.


* If you select some text, it will count the words in the selection.
* It counts the ebook text, excluding the Gutenberg disclaimer and legalese (everything between "*** START OF _____ ***" and "*** END OF ____ ***"). This will work whether you are on the [https://www.gutenberg.org/ebooks/33843 main page], the [https://www.gutenberg.org/ebooks/33843.html.images HTML page] (or [https://www.gutenberg.org/files/33843/33843-h/33843-h.htm as-submitted]), or on the [https://www.gutenberg.org/ebooks/33843.txt.utf-8 plain text page].
* On a normal webpage, it will count all the text on the page
** Slight discrepancies may be present due to extra text (Transcriber's notes, Book summary, subtitles, etc) in some formats that are not present in the others.
* In a Guterberg ebook page it will count the ebook text, excluding the Gutenberg disclaimer and legalese.
:This will work whether you are on the main page, the HTML page (or as-submitted), or on the plain text page.
:* Slight discrepancies may be present due to extra text (Transcriber's notes, Book summary, subtitles, etc) in some formats that are not present in the others.
* '''''BONUS''''': On the Gutenberg search results page, it will append the word count to all search results! (So, if you're looking for a short short story, you don't have to go counting every book that comes up in search.)
* '''''BONUS''''': On the Gutenberg search results page, it will append the word count to all search results! (So, if you're looking for a short short story, you don't have to go counting every book that comes up in search.)
* Update(2023-04-11): Word/character count will now be displayed in a fixed box in the upper right corner of the window, instead of a pop-up alert. Double click to dismiss.
* BONUS: On a normal webpage, it will count all the text on the page.
* '''NEW(2023-04-14)''': When in the HTML page of an ebook, click on the chapter heading to get a word count for that chapter.
* BONUS: If any text is highlighted, it will count the words in the selection.


=== Chapter Counter (beta) ===
<nowiki>javascript:(function(){let e=document.location.toString(),t=window.getSelection()+"";function wx(e,t,n=""){return wc(e.match(/(?<=\*{3} START.*?\*{3}).*(?=\*{3} END.*?\*{3})/s)[0].trim(),t,n)}function wc(e,t,n=""){let o=e.trim().match(/[\*\S]+/g).length;if(n)return o;alert(o+" words, "+e.length+" chars\nin "+t)}function fwx(e,t=""){fetch(e).then((e=>e.text())).then((e=>{let n=wx(e,"ebook",t);t&&(t.children[1].innerHTML+="wc:"+n)})).catch((e=>alert(e)))}if(t.length>0)wc(t,"selection");else if(e.match(/gutenberg.org\/(files|cache\/epub)\/\d+/))wx(document.body.innerText,"ebook");else if(e.match(/gutenberg.org\/ebooks\/(\d+)/))fwx(document.location+".txt.utf-8","ebook");else if(e.match(/gutenberg.org\/ebooks\/search/))for(const e of document.querySelectorAll(".booklink>a"))fwx(e.href+".txt.utf-8",e);else wc(document.body.innerText,"webpage")})();</nowiki>


<u>Get the script here</u>: [https://vox.quartertone.net/#cc ChapterCounter on vox.quartertone.net].
==== Chapter Counter (beta) ====


This script will (attempt to) count the number of words in indexed chapters of a Gutenberg book.
This script counts the number of words in indexed chapters of a Gutenberg book. Navigate to the HTML or the plain text page, and activate the script.
Navigate to the HTML or the plain text page, and activate the script.


Current Limitations:
Current Limitations:
Line 68: Line 60:
* Does not work if the TOC is formatted strangely (eg page numbers or other text interspersed between the chapter titles).
* Does not work if the TOC is formatted strangely (eg page numbers or other text interspersed between the chapter titles).


For questions or issues with either of the above scripts, please post in [https://forum.librivox.org/viewtopic.php?t=96792 this forum thread], or send a Private Message to [https://forum.librivox.org/ucp.php?i=pm&mode=compose&u=141665 ''quartertone''].
<nowiki>javascript:(function(){if(document.location.toString().match(/gutenberg.org\/(files|cache\/epub)\/\d+/)){let i=document.body.innerText.match(/(?<=\*{3} START.*?\*{3}).*(?=\*{3} END.*?\*{3})/s)[0].trim().split(/\n+/),o=[],c="",r="",d={};for(var t of i)if((t=t.trim())&&0!=t.length)if(c||!t.match(/^contents.?$/i)){if("contents"==c)o.length>0&&n(o[0],t)?(c="body",d[r=o[0]]=""):o.push(t);else if("body"==c){if(void 0!==o[o.indexOf(r)+1]&&n(t,o[o.indexOf(r)+1])){d[r=o[o.indexOf(r)+1]]="";continue}d[r]+=t+" "}}else c="contents";var e=document.getElementById("xcount")||document.createElement("div");e.id="xcount",e.style="position:fixed;top:0;right:0;width:20em;height:20em;overflow-y:scroll;background:#333c;color:#fff;",e.innerHTML="",document.body.appendChild(e);for(const t in d)e.innerHTML+=t+": "+d[t].trim().match(/[\*\S]+/g).length+" words<br/><br/>"}function n(t,e){return!(!t||!e)&&(t==e||(!(!t.match(/^chapter.*\./i)||t.match(/^chapter.*\./i)[0]!=e.match(/^chapter.*\./i)[0])||void 0))}})();</nowiki>


=== Google Chrome Counter ===


The link to the extension in the Google Chrome store:
For questions or issues with either of the above scripts, please post in [https://forum.librivox.org/viewtopic.php?t=96792 this forum thread], or send a Private Message to [https://forum.librivox.org/ucp.php?i=pm&mode=compose&u=141665 ''quartertone''].
https://chrome.google.com/webstore/detail/word-counter-plus/fpjegfbcdijjfkceenlfoehpcakfgldj


Click "Add to Chrome". Once installed (you may need to restart your browser after adding it), highlight the text you want the word count for and right-click. There should be a Word Counter Plus option in the right click menu. Once selected, it will display the word count for the selected text.
Note: The scripts above have been ''minified''. To view the scripts in a human-readable format, please visit [https://vox.quartertone.net vox.quartertone.net].


== Gutenberg Progressive word counter ==


This script is adapted from PeeGee's Gutenberg word counting script. The original version of the script was no longer functioning, so this de-bugged version was created. Information about the previous version of this script can be found archived at: [[PeeGee's Script]].
=== Websites ===
Various websites provide an interface to count the number of words in copy-pasted text. Below are some sites that provide accurate word counts:


=== How it Works ===
* [https://wordcounter.net wordcounter.net]
The script runs against the HTML ebooks on [http://www.gutenberg.org/wiki/Main_Page Project Gutenberg].
* [https://wordcounter.io/ wordcounter.io]
# you click the paragraph where you want to start the count,
* [https://thewordcounter.com/ thewordcounter.com]
# it asks you the target number of words,
* [https://easywordcount.com/ easywordcount.com]
# it quickly goes through every paragraph from that point onwards and counts the words and the running total
# it stops when it reaches the target, or the end of the chapter if before
# the page is temporarily changed to display the word counts right there at the end of each paragraph
# you can repeat this as many times as you like, each time you click a paragraph the temporary page changes are removed


=== Screenshots ===
Here are a few screenshots to illustrate the process:


<gallery>
=== Document Editors ===
Image:Wordcount1.png|The first prompt
Image:Wordcount2.png|Choose target number
Image:Wordcount3.png|The count displayed
</gallery>


=== Installing the Script ===
Most document editors (Microsoft Word, Google Docs, LibreOffice, etc) and some basic text editors have a built-in word counting feature. Copy and paste your text into a document, select the text you want to count.


* This script requires the TamperMonkey browser extension which is available for most modern browsers.
* Microsoft Word
** [https://www.tampermonkey.net Get TamperMonkey] for your browser.
** Review &rarr; Word Count
* Instructions for installing the extension can be found in the GutenCount tutorial playlist above.
** MS Word will also automatically display the word count on the bottom status bar, unless this feature has been disabled. To re-enable it, right-click on the bottom status bar and tick the Word Count option.
** YouTube: [https://youtu.be/XeBnn6ma-5Y Installing Tampermonkey/Gutencount].
* Google Docs
* Once Tampermonkey is installed, go to the Greasyfork page for the script:
** Tools &rarr; Word count
** Install [https://greasyfork.org/en/scripts/464790 Gutenberg Progressive Word Count].
** Keyboard shortcut: '''Ctrl + Shift + C'''
** Install the script by clicking the green button that says "Install this script".
* LibreOffice Writer

** Tools &rarr; Word Count

=== Limitations ===
The script only works on the Gutenberg online HTML books, not the ''text'' or ''zipped HTML'', or other formats.


=== Support ===
If you have any problems installing or using this script you can either post a message in [https://forum.librivox.org/viewtopic.php?t=96792 this forum thread], or send a Private Message to [https://forum.librivox.org/ucp.php?i=pm&mode=compose&u=141665 ''quartertone''].

Latest revision as of 18:11, 11 July 2023

Here are a number of useful tools to help both readers and book coordinators get accurate word counts.

Websites

Below are various websites that provide an interface to count the number of words in copy-pasted text. No word limits to what you can paste in, but partial word count of highlighted portions of text is either not available, or requires scrolling to find. All identify the top 10 keywords in the text, check spelling, and to accomodate writers, give a character count useful for typing responses in Twitter, Facebook, etc.

  • [1] Word Counter Plus, extension installs in Edge, Chrome, Brave and other browsers upon Get. Counts words and characters in highlighted segment, in a hover over window.
  • wordcounter.net counts highlighted segments, but word count is displayed at bottom.
  • wordcounter.io type or paste in your text; highlighted portions not counted
  • thewordcounter.com type or paste in your text; highlighted portions not counted
  • easywordcount.com type or paste in your text; highlighted portions not counted
  • wordstotime.com gives an estimated reading time (depending on your speaking speed -- slow, medium, fast) as well as word count.

Document Editors

Most document editors (Microsoft Word, Google Docs, LibreOffice, etc) and some basic text editors have a built-in word counting feature. Copy and paste your text into a document, select the text you want to count.

  • Microsoft Word
    • Review → Word Count
    • MS Word will also automatically display the word count on the bottom status bar, unless this feature has been disabled. To re-enable it, right-click on the bottom status bar and tick the Word Count option. Just highlight the words you want counted, either part or all of the text. Wait a moment and the word count will display (lower left corner) This is a powerful program, great for dividing up whole books into sections, visually dividing long chapters into approximately equal segments, etc. Video tutorial to be added soon.
  • LibreOffice Writer
    • Tools → Word Count
    • A running word count is displayed on the bottom status bar. The status bar cannot be modified but the entire bar may be toggled hidden or displayed by selecting View → Status Bar
  • Google Docs
    • Tools → Word count
    • Keyboard shortcut: Ctrl + Shift + C

Web Browser Bookmarklets

These bookmarklets can be used with most modern web browsers. The code for these scripts can be found by following the links below, or you can find them as Tampermonkey/Greasemonkey scripts on GreasyFork.org.

Gutencount (word counter)

Get the script here: Gutencount on vox.quartertone.net.
Also available as a Tampermonkey/Greasemonkey script, which can be found here: Gutencount on GreasyFork.org
(The Tampermonkey version will not activate on non-Gutenberg sites)

Here are some tutorial videos on how to use this script.

This is a "universal" word counter bookmarklet. With a single click, you can get an accurate word count for the body of a Gutenberg book.

  • If you select some text, it will count the words in the selection.
  • On a normal webpage, it will count all the text on the page
  • In a Guterberg ebook page it will count the ebook text, excluding the Gutenberg disclaimer and legalese.
This will work whether you are on the main page, the HTML page (or as-submitted), or on the plain text page.
  • Slight discrepancies may be present due to extra text (Transcriber's notes, Book summary, subtitles, etc) in some formats that are not present in the others.
  • BONUS: On the Gutenberg search results page, it will append the word count to all search results! (So, if you're looking for a short short story, you don't have to go counting every book that comes up in search.)
  • Update(2023-04-11): Word/character count will now be displayed in a fixed box in the upper right corner of the window, instead of a pop-up alert. Double click to dismiss.
  • NEW(2023-04-14): When in the HTML page of an ebook, click on the chapter heading to get a word count for that chapter.

Chapter Counter (beta)

Get the script here: ChapterCounter on vox.quartertone.net.

This script counts the number of words in indexed chapters of a Gutenberg book. Navigate to the HTML or the plain text page, and activate the script.

Current Limitations:

  • The Table of contents (TOC) must be present, and labeled Contents.
  • TOC should not include any sections that appear before the contents list (eg, Preface).
  • Does not work if chapter headings do not match the TOC (eg, TOC lists Chapter IV, but chapter headings appear as IV).
  • Does not work if the TOC is formatted strangely (eg page numbers or other text interspersed between the chapter titles).

For questions or issues with either of the above scripts, please post in this forum thread, or send a Private Message to quartertone.

Google Chrome Counter

The link to the extension in the Google Chrome store: https://chrome.google.com/webstore/detail/word-counter-plus/fpjegfbcdijjfkceenlfoehpcakfgldj

Click "Add to Chrome". Once installed (you may need to restart your browser after adding it), highlight the text you want the word count for and right-click. There should be a Word Counter Plus option in the right click menu. Once selected, it will display the word count for the selected text.

Gutenberg Progressive word counter

This script is adapted from PeeGee's Gutenberg word counting script. The original version of the script was no longer functioning, so this de-bugged version was created. Information about the previous version of this script can be found archived at: PeeGee's Script.

How it Works

The script runs against the HTML ebooks on Project Gutenberg.

  1. you click the paragraph where you want to start the count,
  2. it asks you the target number of words,
  3. it quickly goes through every paragraph from that point onwards and counts the words and the running total
  4. it stops when it reaches the target, or the end of the chapter if before
  5. the page is temporarily changed to display the word counts right there at the end of each paragraph
  6. you can repeat this as many times as you like, each time you click a paragraph the temporary page changes are removed

Screenshots

Here are a few screenshots to illustrate the process:

Installing the Script

  • This script requires the TamperMonkey browser extension which is available for most modern browsers.
  • Instructions for installing the extension can be found in the GutenCount tutorial playlist above.
  • Once Tampermonkey is installed, go to the Greasyfork page for the script:


Limitations

The script only works on the Gutenberg online HTML books, not the text or zipped HTML, or other formats.


Support

If you have any problems installing or using this script you can either post a message in this forum thread, or send a Private Message to quartertone.