Function Repository Resource:

ReadabilityScore

Source Notebook

Calculate the readability of a text using a standard formula

Contributed by: Jesse Friedman

ResourceFunction["ReadabilityScore"][text]

calculates the Flesch–Kincaid grade level score of text.

ResourceFunction["ReadabilityScore"][text,type]

calculates the readability score of text using readability test type.

ResourceFunction["ReadabilityScore"][{text₁,text₂,…},…]

calculates the readability score of the text_i.

ResourceFunction["ReadabilityScore"][text,{type₁,type₂,…}]

calculates the readability score of text using readability tests type_i.

ResourceFunction["ReadabilityScore"][type]

represents an operator form of ResourceFunction["ReadabilityScore"] that can be applied to an expression.

Details and Options

The following readability tests are supported:

"ARI"

Automated Readability Index grade level »

"ColemanLiau"

Coleman–Liau grade level »

"ColemanLiauCloze"

Coleman-Liau cloze fraction »

"FleschKincaid"

Flesch–Kincaid grade level(default)»

"FleschReadingEase"

Flesch Reading Ease score »

"FORCAST"

FORCAST grade level »

"GunningFog"

Gunning fog index grade level »

"LensearWrite"

Lensear Write index

"Rix"

Interpolated Rix grade level

"RixScore"

Rix score

"SMOG"

SMOG grade level »

ResourceFunction["ReadabilityScore"]["Types"] returns a list of supported readability tests.

For the "ColemanLiauCloze", "FleschReadingEase", and "LensearWrite" tests, higher values indicate higher readability. For all other tests, higher values indicate lower readability.

Grade level indices are meant to estimate the U.S. grade level (or number of years of education, for values greater than 12) generally required to understand the text.

ResourceFunction["ReadabilityScore"][type][text] is equivalent to ResourceFunction["ReadabilityScore"][text,type].

The following option is supported:

"UseWordData"

False

whether to check WordData for syllable hyphenation and phonetic data

Some readability tests require syllable metrics, which are obtained using the resource function WordSyllables. By default, only the built-in syllabic heuristics in WordSyllables are used. Setting "UseWordData" → True will cause WordSyllables to check WordData for hyphenation data, which may be slightly more accurate but generally takes longer.

Examples

Basic Examples (3)

Calculate the Flesch-Kincaid grade level score of Rudyard Kipling’s The Jungle Book:

In[1]:=

Out[1]=

Calculate the Rix score of Charles Darwin’s On the Origin of Species:

In[2]:=

Out[2]=

Calculate the Automated Readability Index grade level score of Stephen Wolfram’s A New Kind of Science:

In[3]:=

Out[3]=

Scope (4)

Calculate the grade level of Lewis Carroll’s Through the Looking-Glass using multiple different readability tests:

In[4]:=

Out[4]=

Compare the Lensear Write scores for two novels involving gabled houses (higher values indicate higher readability):

In[5]:=

ResourceFunction["ReadabilityScore"][{
ResourceData["Anne of Green Gables"],
ResourceData["The House of the Seven Gables"]
}, "LensearWrite"]

Out[5]=

Calculate the Automated Readability Index grade levels of several works of fiction by Sir Arthur Conan Doyle:

In[6]:=

In[7]:=

scores = ResourceFunction["ReadabilityScore"]["ARI"] /@ Association[
books // SortBy[AbsoluteTime@#["SourceMetadata", "Date"] &] // Map[#["Name"] -> ResourceData[#] &]]

Out[7]=

Chart the scores:

In[8]:=

Out[8]=

List the readability score types supported by ReadabilityScore:

In[9]:=

Out[9]=

Options (2)

UseWordData (2)

By default, readability tests which require syllabic metrics are restricted to using the (generally excellent) built-in heuristics in WordSyllables:

In[10]:=

Out[10]=

Setting "UseWordData"→True allows WordSyllables to look up syllable hyphenation in WordData, which may be slightly more accurate but can take significantly longer to access:

In[11]:=

Out[11]=

Neat Examples (1)

Calculate and plot the Flesch-Kincaid grade level indices of American presidents’ inaugural addresses over time:

In[12]:=

addressScores = ResourceData["Presidential Inaugural Addresses"][
GroupBy[Key[
"Date"] -> ({#Name, ResourceFunction["ReadabilityScore"][#Text, "FleschKincaid"]} &)]/*Map[First]/*KeySort];

In[13]:=

DateListPlot[{
addressScores[
TimeSeries[KeyValueMap[{#1, #2[[2]]} &, #]] -> Values[#][[All, 1]] &]
}, Joined -> True, Mesh -> All, PlotStyle -> Dashed, ImageSize -> Large]

Out[13]=

Publisher

Jesse Friedman

Version History

1.0.0 – 22 October 2019

Source Metadata

Citation:
- Kincaid, J. P., Leroy J. D., "Validation of the Automated Readability Index: A Follow-Up." Human Factors, vol. 15, no. 1, 1973, 17–20. doi:10.1177/001872087301500103.
- Coleman, M., Liau, T. L., "A Computer Readability Formula Designed for Machine Scoring." Journal of Applied Psychology, vol. 60, no. 2, 1975, 283–284. doi:10.1037/h0076540.
- Kincaid, J. P. et al., "Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel." Research Branch Report 8-75, Chief of Naval Technical Training, 1975.
- Flesch, R., "A New Readability Yardstick." Journal of Applied Psychology, vol. 32, no. 3, 1948, 221–233. doi:10.1037/h0057532.
- Caylor, J. S. et al., "Methodologies for Determining Reading Requirements of Military Occupational Specialties." Human Resources Research Organization, Alexandria VA, 1973.
- Gunning, R., "Principles of Clear News Writing." The Scripps-Howard Newspapers, 1951, pg. 93. https://catalog.hathitrust.org/Record/006673379
- O’Hayre, J., U.S. Dept. of the Interior, Bureau of Land Management. "Gobbledygook Has Gotta Go." U.S. Government Printing Office, 1966, pg. 8. https://catalog.hathitrust.org/Record/001463994
- Anderson, J., "Lix and Rix: Variations on a Little-Known Readability Index." Journal of Reading, vol. 26, no. 6, 1983, pg. 490.
- McLaughlin, G. H., "SMOG Grading-a New Readability Formula." Journal of Reading, vol. 12, no. 8, 1969, 639–646.

Related Resources

License Information

This work is licensed under a Creative Commons Attribution 4.0 International License

Wolfram Function Repository

ReadabilityScore

Details and Options