<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<rss version="2.0">
  <channel>
  <title>php-text-statistics Google Group</title>
  <link>http://groups.google.co.uk/group/php-text-statistics</link>
  <description>A group for the php-text-statistics project on Google Code at http://code.google.com/p/php-text-statistics/</description>
  <language>en-GB</language>
  <item>
  <title>Interesting tool!</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/47f62c8f44fad2f0/9a015b980ced7bf0?show_docid=9a015b980ced7bf0</link>
  <description>
  I&#39;m interested in building a module for Drupal that automatically &lt;br&gt; reports on the readability of the text that&#39;s been entered. I had run &lt;br&gt; across some command line tools, but doing it all in PHP sounds much &lt;br&gt; easier to adopt. &lt;br&gt; &lt;p&gt;This is part of a larger accessibility evaluation that we&#39;re doing, &lt;br&gt; but I wanted to just thank you for contributing this code to the
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/47f62c8f44fad2f0/9a015b980ced7bf0?show_docid=9a015b980ced7bf0</guid>
  <author>
  mike.giff...@gmail.com
  (Mike Gifford)
  </author>
  <pubDate>Fri, 10 Oct 2009 15:14:55 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability Grades</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/30f4ef6c1bc9f0c2?show_docid=30f4ef6c1bc9f0c2</link>
  <description>
  It depends. It depends on the reading grade score you use. If we take &lt;br&gt; the Flesch-Kincaid reading grade level then this gives us the level of &lt;br&gt; schooling that a reader would require if they were to read your text. &lt;br&gt; Ie: if you get a reading grade level of 7 then this means that the &lt;br&gt; reader would need to have an education level equivalent to year 7
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/30f4ef6c1bc9f0c2?show_docid=30f4ef6c1bc9f0c2</guid>
  <author>
  joel...@cyberone.com.au
  (Joel Nation)
  </author>
  <pubDate>Sun, 09 Sep 2009 10:40:22 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability Grades</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/374392b8ad5bf03e?show_docid=374392b8ad5bf03e</link>
  <description>
  Good day , as far as the readability formulas are concern, they are just tools to measure or predict how comprehensible a &amp;quot;technical text&amp;quot; might be. &lt;br&gt; With regards to literary pieces such as prose or poems, applying readability formulas could be such a waste why? &lt;br&gt; Youre the apple of my eye , and you are beautiful to me may mean the same thing, when using the readability formulas, word, sentence and paragraph length are considered therefore making prose and poem somehow an exception due to the meaning they have regardless of how long or short it could be.
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/374392b8ad5bf03e?show_docid=374392b8ad5bf03e</guid>
  <author>
  ac3bu...@yahoo.com
  (merald ACE)
  </author>
  <pubDate>Sun, 09 Sep 2009 01:51:39 UT
</pubDate>
  </item>
  <item>
  <title>Readability Grades</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/0c3de52091496882?show_docid=0c3de52091496882</link>
  <description>
  When writing short stories or novels how valuable is readability &lt;br&gt; grades when assessing the quality of your writing? Without going into &lt;br&gt; grammar correction programs. Plus when judging assessment levels &lt;br&gt; concerning Average grade levels, what is considerer an expectable &lt;br&gt; Average grade level? I have read volumes, regarding Average grade
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/1370fd066d43e938/0c3de52091496882?show_docid=0c3de52091496882</guid>
  <author>
  tony.tarrow.arrowsmi...@gmail.com
  (sooty)
  </author>
  <pubDate>Thu, 09 Sep 2009 11:56:41 UT
</pubDate>
  </item>
  <item>
  <title>Re: Multi-byte string functions</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bd868b04c15348c8/814c865eadbd271e?show_docid=814c865eadbd271e</link>
  <description>
  I wrapped function_exists(&#39;mb_*&#39;) checks around each of the sections &lt;br&gt; where a multi-byte function called. Is this the reason why there are &lt;br&gt; try catch blocks wrapped around them? If so, I can remove the try &lt;br&gt; catch blocks as they didn&#39;t work to avoid the case where the mb_* &lt;br&gt; functions were not available.
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bd868b04c15348c8/814c865eadbd271e?show_docid=814c865eadbd271e</guid>
  <author>
  tpwal...@gmail.com
  (TomW)
  </author>
  <pubDate>Sun, 03 Mar 2009 10:49:35 UT
</pubDate>
  </item>
  <item>
  <title>Multi-byte string functions</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bd868b04c15348c8/61b72b2b8793916b?show_docid=61b72b2b8793916b</link>
  <description>
  The PHP installation does not have the multi-byte string functions &lt;br&gt; enabled by default. Is there a way to determine if they are enabled &lt;br&gt; and then use them, and otherwise skip straight to the built-in strlen?
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bd868b04c15348c8/61b72b2b8793916b?show_docid=61b72b2b8793916b</guid>
  <author>
  tpwal...@gmail.com
  (TomW)
  </author>
  <pubDate>Sun, 03 Mar 2009 10:36:17 UT
</pubDate>
  </item>
  <item>
  <title>Integration with HTML-Kit</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bda0b49da8527c00/13024da90828e29c?show_docid=13024da90828e29c</link>
  <description>
  Hello, &lt;br&gt; &lt;p&gt;For background, I regularly use an editor known as HTML-Kit (http:// &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://www.htmlkit.com/&quot;&gt;[link]&lt;/a&gt;) for web site work. It is extensible through a wide- &lt;br&gt; ranging plugin system. On the support newsgroups a couple of weeks &lt;br&gt; ago, there was a question about writing some sort of readability &lt;br&gt; statistics plugin. It has been a few years since the last time I wrote
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/bda0b49da8527c00/13024da90828e29c?show_docid=13024da90828e29c</guid>
  <author>
  tpwal...@gmail.com
  (TomW)
  </author>
  <pubDate>Tue, 03 Mar 2009 23:19:37 UT
</pubDate>
  </item>
  <item>
  <title>Readability Questions</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5feaf6ab0e388ed0/bfd75f2d6ce1c7ae?show_docid=bfd75f2d6ce1c7ae</link>
  <description>
  Thank you so much Mr. David Child for that prompt response, Im still somehow confused. Now that you mention it. Does it affect the credibility? Does it no longer necessarily mean that the shorter the number of words used, the higher the grade will be? My defense is approaching and I&#39;m looking into what i can consider loophole with our study regarding readability statistics. What are the implications of your explanation about the average syllables? What could be the best suggestion in order for us to have a higher grade. Thank You so much and Good Day to all.
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5feaf6ab0e388ed0/bfd75f2d6ce1c7ae?show_docid=bfd75f2d6ce1c7ae</guid>
  <author>
  ac3bu...@yahoo.com
  (merald ACE)
  </author>
  <pubDate>Fri, 01 Jan 2009 16:25:10 UT
</pubDate>
  </item>
  <item>
  <title>Re: Grading</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/719b4e4c9ae2f3c3/a5aa3c6d880b4b97?show_docid=a5aa3c6d880b4b97</link>
  <description>
  &amp;quot;The quick brown fox jumped over the lazy dog.&amp;quot; - 94.3 &lt;br&gt; &amp;quot;The lazy dog was jumped over by the quick brown fox.&amp;quot; - 95.7 &lt;br&gt; &lt;p&gt;The difference is the addition of the two words &amp;quot;was&amp;quot; and &amp;quot;by&amp;quot;. &lt;br&gt; Although the second sentence is slightly longer, the shorter words &lt;br&gt; help reduce the average syllable count, which is why the scores are
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/719b4e4c9ae2f3c3/a5aa3c6d880b4b97?show_docid=a5aa3c6d880b4b97</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Fri, 01 Jan 2009 10:02:06 UT
</pubDate>
  </item>
  <item>
  <title>Grading</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/719b4e4c9ae2f3c3/24e4a6cdedcd4bd8?show_docid=24e4a6cdedcd4bd8</link>
  <description>
  Hi guys. i know it might somehow sound odd but i thought , based on &lt;br&gt; the rules of readability . the active voice which has fewer words than &lt;br&gt; passive ones .. But when i checked the phrase The quick brown fox jump &lt;br&gt; over the lazy dog and &amp;quot; The lazy dog, was jumped over by the quick &lt;br&gt; brown fox.&amp;quot; the latter rendered a higher score. Please help me.. This
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/719b4e4c9ae2f3c3/24e4a6cdedcd4bd8?show_docid=24e4a6cdedcd4bd8</guid>
  <author>
  ac3bu...@gmail.com
  (ac3buddy@gmail.com)
  </author>
  <pubDate>Fri, 01 Jan 2009 01:45:41 UT
</pubDate>
  </item>
  <item>
  <title>Re: translated to Ruby</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/3836feb6024b63cf?show_docid=3836feb6024b63cf</link>
  <description>
  My pleasure. &lt;br&gt; &lt;p&gt;I couldn&#39;t figure out how to translate that one line in the clean text &lt;br&gt; function: &lt;br&gt; (&#39;$matches&#39;, &#39;return strtolower($matches[0]);&#39;), $strText); // Lower &lt;br&gt; case all words following terminators (for gunning fog score) &lt;br&gt; &lt;p&gt;So I guess that&#39;s the reason why my Gunning-Fog is off a bit. &lt;br&gt; &lt;p&gt;Would you mind adding these two files to the repository? I won&#39;t have any
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/3836feb6024b63cf?show_docid=3836feb6024b63cf</guid>
  <author>
  kapel...@gmail.com
  (Adam Kapelner)
  </author>
  <pubDate>Wed, 01 Jan 2009 18:46:37 UT
</pubDate>
  </item>
  <item>
  <title>Re: translated to Ruby</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/403ff36546faf68b?show_docid=403ff36546faf68b</link>
  <description>
  Great work Adam!
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/403ff36546faf68b?show_docid=403ff36546faf68b</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Wed, 01 Jan 2009 10:02:25 UT
</pubDate>
  </item>
  <item>
  <title>translated to Ruby</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/4b8ce6b3fd033445?show_docid=4b8ce6b3fd033445</link>
  <description>
  Hello all, &lt;br&gt; &lt;p&gt;I&#39;ve translated the php-text-statistics package to Ruby, you can view &lt;br&gt; the files below. Please note I couldn&#39;t get the Gunning Fog Score to &lt;br&gt; work 100% &lt;br&gt; &lt;p&gt;Regards, &lt;br&gt; Adam &lt;br&gt; &lt;p&gt;require &#39;collections/sequenced_hash&#39; &lt;br&gt; &lt;p&gt;module ReadabilityIndices &lt;br&gt; &lt;p&gt; class Readability &lt;br&gt; &lt;p&gt; NumDecimalPlaces = 1
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/5a9e7aea30d7a456/4b8ce6b3fd033445?show_docid=4b8ce6b3fd033445</guid>
  <author>
  kapel...@gmail.com
  (way4thesub)
  </author>
  <pubDate>Tue, 01 Jan 2009 23:16:45 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/7c9b5f5050166d22?show_docid=7c9b5f5050166d22</link>
  <description>
  And I completely forgot to actually reply to the bulk of your message, &lt;br&gt; Joel. Sorry about that - was distracted by bacon :) &lt;br&gt; I think that&#39;s a great idea - identifying places for improvements, and &lt;br&gt; highlighting difficult words, would make a really useful tool. Making &lt;br&gt; blanket suggestions is a good start, and a synonym mashup would be
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/7c9b5f5050166d22?show_docid=7c9b5f5050166d22</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Sat, 10 Oct 2008 13:04:38 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/f782dd9c473ec3c0?show_docid=f782dd9c473ec3c0</link>
  <description>
  I&#39;ve run the unit tests and your changes work fine on the test text - &lt;br&gt; great stuff, Joel. &lt;br&gt; It would be useful to have some test HTML to run unit tests against. &lt;br&gt; I&#39;ll start putting some together. I&#39;m also trying to sort out the &lt;br&gt; Dale-Chall and Spache unit tests. I&#39;ve added the word lists to the &lt;br&gt; repository already so others can have a play with them if they want.
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/f782dd9c473ec3c0?show_docid=f782dd9c473ec3c0</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Sat, 10 Oct 2008 13:00:07 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/2d3985e19c6476ea?show_docid=2d3985e19c6476ea</link>
  <description>
  I agree, they do look a little dodgey (especially when it&#39;s really &lt;br&gt; random numbers), but they have all been tested and there relative &lt;br&gt; effectiveness has been ranked (dale-chall being the best one I know &lt;br&gt; of). I think we can use computers in another way - to suggest how to &lt;br&gt; improve the text. Since most of them rely on sentence length, the
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/2d3985e19c6476ea?show_docid=2d3985e19c6476ea</guid>
  <author>
  joel...@cyberone.com.au
  (Joel Nation)
  </author>
  <pubDate>Thu, 10 Oct 2008 06:53:37 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/0a7b6ef9999b86a7?show_docid=0a7b6ef9999b86a7</link>
  <description>
  Hi Joel, &lt;br&gt; Great work. Will run tests against PHPUnit when at home later, but all &lt;br&gt; looks fine. &lt;br&gt; I&#39;ve been working, sporadically, on a few of the other various &lt;br&gt; readability scores, including Spache and Dale-Chall. Would be great to &lt;br&gt; see what you&#39;ve come up with for Dale-Chall so far. &lt;br&gt; Some of the readability scores are decidedly ropey, I&#39;ve come to
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/0a7b6ef9999b86a7?show_docid=0a7b6ef9999b86a7</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Tue, 10 Oct 2008 10:27:20 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/8560339e910640e8?show_docid=8560339e910640e8</link>
  <description>
  Okay I checked in my first changes. This covers all the HTML tags we &lt;br&gt; use at my work that should have a full stop in front of them. There &lt;br&gt; may be a couple of others, but this should cover the vast majority of &lt;br&gt; HTML use. I didn&#39;t use a preg_replace, more comfortable out of the &lt;br&gt; world of regexps! I don&#39;t have PHP4, but I&#39;ll check in a PHP4 version
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/8560339e910640e8?show_docid=8560339e910640e8</guid>
  <author>
  joel...@cyberone.com.au
  (Joel Nation)
  </author>
  <pubDate>Tue, 10 Oct 2008 10:00:58 UT
</pubDate>
  </item>
  <item>
  <title>Re: Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/1e84c4cd4a03bedd?show_docid=1e84c4cd4a03bedd</link>
  <description>
  Hi Joel, &lt;br&gt; Good points all. I&#39;ve added you as a member to the project at &lt;br&gt; &lt;a target=&quot;_blank&quot; rel=nofollow href=&quot;http://code.google.com/p/php-text-statistics/&quot;&gt;[link]&lt;/a&gt; - you should be able to &lt;br&gt; commit code now. Looking forward to seeing your additions! &lt;br&gt; Dave
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/1e84c4cd4a03bedd?show_docid=1e84c4cd4a03bedd</guid>
  <author>
  d...@addedbytes.com
  (David Child)
  </author>
  <pubDate>Thu, 09 Sep 2008 18:17:32 UT
</pubDate>
  </item>
  <item>
  <title>Readability of html</title>
  <link>http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/71aa245bfbbd9571?show_docid=71aa245bfbbd9571</link>
  <description>
  The problem with these readability scores is that they don&#39;t take into &lt;br&gt; consideration the way html works. For instance you very rarely put a &lt;br&gt; full stop in a heading tag (eg: &amp;lt;h1&amp;gt;Hello.&amp;lt;/h1&amp;gt;) but this will affect &lt;br&gt; most of the scores as that word will now be added to the next sentence &lt;br&gt; and make it longer then it actually is. And with lots of headings you
  </description>
  <guid isPermaLink="true">http://groups.google.co.uk/group/php-text-statistics/browse_frm/thread/8d6a53b202a96caa/71aa245bfbbd9571?show_docid=71aa245bfbbd9571</guid>
  <author>
  joel...@cyberone.com.au
  (Joel Nation)
  </author>
  <pubDate>Mon, 09 Sep 2008 09:21:49 UT
</pubDate>
  </item>
  </channel>
</rss>
