Google Mail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
Message from discussion Readability of html

View Parsed - Show only message text

MIME-Version: 1.0
Received: by 10.150.11.14 with SMTP id 14mr166496ybk.25.1222075309231; Mon, 22 
	Sep 2008 02:21:49 -0700 (PDT)
Date: Mon, 22 Sep 2008 02:21:49 -0700 (PDT)
X-IP: 121.223.177.177
User-Agent: G2/1.0
X-HTTP-UserAgent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10_4_11; en) 
	AppleWebKit/525.18 (KHTML, like Gecko) Version/3.1.2 Safari/525.22,gzip(gfe),gzip(gfe)
Message-ID: <8c4e15ff-20dd-4683-8ed5-f1dcf5fc81a0@k36g2000pri.googlegroups.com>
Subject: Readability of html
From: Joel Nation <joel...@cyberone.com.au>
To: php-text-statistics <php-text-statistics@googlegroups.com>
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

The problem with these readability scores is that they don't take into
consideration the way html works. For instance you very rarely put a
full stop in a heading tag (eg: <h1>Hello.</h1>) but this will affect
most of the scores as that word will now be added to the next sentence
and make it longer then it actually is. And with lots of headings you
actually making the page more readable. Ditto for lists as (atleast at
my work) we don't generally put a full stop after a list item. Running
the scores on one of our pages (http://www.accc.gov.au/content/
index.phtml/itemId/815360) I initially get a Flesch Kincaid Grade
Level of 24.9 (if I run it over the entire page) and 18.2 if I strip
the html tags out. But I get a much better score of 8.3 if I add full
stops after the correct tags before stripping the tags out. Of course
running it over just the content I start with a reading level of 11,
but I still end up with the 8.3 I add the full stops. I would suggest
that the code should be modified to take this into consideration. It's
only a few lines of extra code and I'm happy to check in my changes to

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google