Monday, January 5, 2026

Bigram Analysis: Neville's Letters vs. Shakespeare

I created an XML file of Neville's letters from Winwood's Memorials of Affairs of State, Vol 1 and 2. That's 89 letters Henry Neville wrote, mostly as ambassador from 1599-1601.

Using Pervez Rizvi's database of early modern English plays, I did a comparison of bigrams (two word combos) to see which plays more closely match the letters. I evaluated 239 plays from 1590-1615. The results are stunning. Shakespeare's plays rank at the top consistently:

Rank Year Similarity Title
116130.6126Henry VIII [Full Play]
216090.6079The Winter's Tale
315990.5897Henry V
416130.5866Henry VIII [Shakespeare Section]
516100.5843Cymbeline
616030.5736All's Well That Ends Well
716000.5687Cynthia's Revels (Jonson)
815970.5659Henry IV, Part 2
916080.5652Coriolanus
1016020.5645The Royal King and the Loyal Subject
1116070.5597The Tragedy of Charles Duke of Byron
1215950.5585Love's Labor's Lost
1316030.5583Measure for Measure
1415990.55281 Edward the Fourth
1515990.5509Every Man Out of His Humour (Jonson)
1615970.5473Henry IV, Part 1
1716050.5448Philotas
1816010.5431Hamlet
1915990.54202 Edward the Fourth
2016090.5414Epicoene (Jonson)
2115910.5404Henry VI, Part 2
2216070.5398The Conspiracy of Charles Duke of Byron
2316050.5392King Lear
2416140.5381The Hector of Germany
2516140.5356Bartholomew Fair (Jonson)
2616040.5333Sejanus His Fall (Jonson)
2716130.5326Henry VIII [Fletcher Section]
2816060.5324The Isle of Gulls
2916040.5310The Widow's Tears
3016050.5301Volpone (Jonson)
3116110.5299Catiline His Conspiracy (Jonson)
3216060.5289Antony and Cleopatra
3316100.5285The Revenge of Bussy D'Ambois
3416140.5280The Staple of News
3515980.5240Every Man in His Humour (Jonson)
3615920.5216A Knack to Know a Knave
3715900.5210Jack Straw
3815960.5205The Merchant of Venice
3916040.5187When You See Me You Know Me
4015950.5173Richard II

This is not conclusive evidence that Henry Neville wrote the works of Shakespeare. But it is an objective and reproducible test that shows a clear affinity between the two-word phrases Henry Neville and Shakespeare used. 

This overlap is partly due to the topic of the plays aligning with the experiences Neville had as ambassador. This is not a defect in this study. Quite the opposite, the overlap is another piece of strong evidence.

This research was done with the help of Claude Code. 

I ran a similar test, with the help of ChatGPT Codex, that reduces reliance on topical words.  "The new test uses function‑word bigrams only (top 200 MFW), then compares length‑matched windows with bootstrapping and reports mean ± std. This reduces topical bias and makes comparisons fairer across different text lengths." Very similar results:

RankYear  TitleMean_Sim
11613Henry VIII [Shakespeare Sect]0.7802
21613Henry VIII0.7548
31599Henry V0.7533
41607Tragedy of Charles Duke of Byron0.7435
51607Conspiracy of Charles Duke of Byron0.7386
61609The Winter's Tale0.7342
71605Philotas0.7341
81606Macbeth0.7309
91614The Hector of Germany0.7268
101595Richard II0.7219
111604Sejanus His Fall0.7207
121610Cymbeline0.7202
131608Coriolanus0.7191
141597Henry IV, Part 20.7165
151590The Reign of King Edward the Third0.7160
161591Locrine0.7147
171606The Rape of Lucrece0.7130
181592Summer's Last Will and Testament0.7129
191610The Revenge of Bussy D'Ambois0.7117
201603The Family of Love0.7116
211596King John0.7108
221592Henry VI, Part 10.7106
231606Hymenaei0.7103
241603All's Well That Ends Well0.7094
251595Love's Labor's Lost0.7091
261611The Atheist's Tragedy0.7089
2715911 The Troublesome Reign of King John0.7055
281613Henry VIII [Fletcher Section]0.7018
291604The Widow's Tears0.7017
301590The Love of David and Fair Bathsheba0.7012
311593The Massacre at Paris0.7005
321611Catiline His Conspiracy0.6993
331591Henry VI, Part 20.6987
3415912 The Troublesome Reign of King John0.6970
351610The Golden Age0.6965
361606The Isle of Gulls0.6942
371614The Staple of News0.6939
381603Measure for Measure0.6935
391606Antony and Cleopatra0.6930
401590Jack Straw0.6928

I have placed all the necessary information to reproduce both tests here: https://nevilleresearch.com/bigram/