Skip to content

Conversation

sts10
Copy link

@sts10 sts10 commented Jan 16, 2023

Thought I'd take a stab at creating a new 46,656-word list for use in this project. (In a way this works toward the goals of #4.)

The words are from my own scrape of 2012 Google Ngram data and Niceware v. 4.0 word list. I created it using a tool I made called Tidy.

Some attributes describing the new word list:

List length               : 46656 words
Mean word length          : 8.04 characters
Length of shortest word   : 4 characters (able)
Length of longest word    : 12 characters (youthfulness)
Entropy per word          : 15.510 bits

And here are some generated passphrases from the list:

surgeons revivalist whether waders impulsively flippant 
humanely gallon compile integers aught outpouring 
revered enfranchise stalactites spooking daintier wreak 
cribbed slaved gratitude narrowed collection materialists 
fibula diameter nutritional furniture gullies surmise 

Note that, in order to include a wider range of commonly used words, I've allowed words as short as 4 characters and as long as 12 into the list. I realize that that may disqualify this list out-right, but I thought I'd submit this PR anyway.

@dmuth dmuth self-assigned this Jan 16, 2023
@dmuth
Copy link
Owner

dmuth commented Jan 16, 2023

Thanks! I see the mean word length is 8 characters, so that looks pretty reasonable.

I have some travel coming up which will go through next weekend, so I expect to be able to look through this and try implementing it in a couple of weeks.

-- Doug

…d Niceware list to make a new wordlist-6-dice list
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants