Sex, death and sonnets -- musings of a software developer

Sigfrid Lundberg

This note discusses how software can recognize sonnets, by analysis of text length, strophe structure and number of syllables per line. It also makes a simple content analysis based on word frequency analyses.

The results clearly shows that simple Unix™ for Poets analyses combines seamlessly with TEI markup and XML technologies.

You get best reading comfort by downloading the PDF

How to work with these problems

I doubt that you want to work with these problems in the way I do. Still it you might want to check things, or perhaps even develop things further. Note that you must clone two git repositories in order to do that.

git clone git@github.com:siglun/danish-sonnets.git
git clone git@github.com:kb-dk/public-adl-text-sources.git

To retry my analyses you'll must work on a Unix or Linux system, have JAVA and perl programming languages and both xsltproc and SAXON xslt processors. There are also shell scripts and a Makefile and I do some processing using xmllint.

The documents are authored in TEI xml and you need xsltproc and GNU groff to format them.

There is a parameters.sh which is sourced all over the place. Edit it to set relevant parameters:

SAXON_JAR="/where/your/Saxon-HE-9.9. or better can be found .jar"
SAXON="java -jar $SAXON_JAR "

PROJECTS="$HOME/projects"

HERE="$PROJECTS/danish-sonnets"
THERE="$PROJECTS/public-adl-text-sources"

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
build-document.sh		build-document.sh
distribution.text		distribution.text
distro.eps		distro.eps
distro.pdf		distro.pdf
distro.png		distro.png
find_rhyme_structure.sh		find_rhyme_structure.sh
find_sonnet_candidates.sh		find_sonnet_candidates.sh
find_the_rhyme.pl		find_the_rhyme.pl
find_verse_structures.sh		find_verse_structures.sh
find_word_frequencies.sh		find_word_frequencies.sh
find_words.pl		find_words.pl
frequencies.text		frequencies.text
html_print.css		html_print.css
html_print_css.xml		html_print_css.xml
iterate_the_rhyming.xsl		iterate_the_rhyming.xsl
iterate_the_words.xsl		iterate_the_words.xsl
parameters.ms		parameters.ms
parameters.sh		parameters.sh
plot_distro.gp		plot_distro.gp
poem_frequencies.text		poem_frequencies.text
render.xsl		render.xsl
rhyme_structure.xsl		rhyme_structure.xsl
rhymes_2chars.text		rhymes_2chars.text
rhymes_3chars.text		rhymes_3chars.text
sonnet-analysis.html		sonnet-analysis.html
sonnet-analysis.md		sonnet-analysis.md
sonnet-analysis.ms		sonnet-analysis.ms
sonnet-analysis.pdf		sonnet-analysis.pdf
sonnet-analysis.xml		sonnet-analysis.xml
sonnet_candidate.xsl		sonnet_candidate.xsl
sonnet_candidates.xml		sonnet_candidates.xml
strophe_structure_distro.xml		strophe_structure_distro.xml
tei-to-markdown.xsl		tei-to-markdown.xsl
teip5toms.xsl		teip5toms.xsl
to-markdown.sh		to-markdown.sh
verse_structure.xsl		verse_structure.xsl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sex, death and sonnets -- musings of a software developer

How to work with these problems

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sex, death and sonnets -- musings of a software developer

How to work with these problems

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages