Web word count

Ms Hollands

Cow lover
Super Member
Registered
Joined
Aug 15, 2008
Messages
1,151
Reaction score
135
Location
La Clusaz, France
Website
www.lefrancophoney.com
Is there any way I can calculate how many words are published on a website, not including all the HTML tagging etc.? I've been offered some proofing work on a very large website, but it's massive and I fear it will take some time to figure out.

So far, the best method I can think of is to copy and paste the text into one big MS Word document and do a word count on it, but is there a faster way? There are a ridiculous number of pages going on...
 

Priene

Out to lunch
Super Member
Registered
Joined
Aug 25, 2007
Messages
6,422
Reaction score
879
I would almost certainly write a program to do this, as I couldn't find much promising freeware that already has this function.

You could copy each web page into word and then use wildcard replace to get rid of anything in a tag: all strings starting with '<' and ending with '>'. Other than that, I'm having difficulty thinking of an easy solution.
 

felipe

Registered
Joined
Mar 6, 2009
Messages
41
Reaction score
1
Location
Costa Rica
Website
intribu.org
There are a number of tools that use a javascript to count the words, but they would work the same way as a Word. You would still have to copy/paste everything.
If the text is long, it will take a while.
When you say website, you mean the entire website?
If this is the case, it really is a hard work, because most modern websites store most of their information on a database, you would have to generate each page, download it and then count it.
An application for that, sure would be a handy one.
 

Ms Hollands

Cow lover
Super Member
Registered
Joined
Aug 15, 2008
Messages
1,151
Reaction score
135
Location
La Clusaz, France
Website
www.lefrancophoney.com
It's a website based on php, so I loaded each page, copied and pasted into MS Word, and word counted from that.

It only took about 10 minutes in the end as I'm fast with my keyboard shortcuts :O)