Post-Proofread
From Open Siddur Project Development Wiki
The following is a list of changes that should be made to a transcribed text in a pre-encoding stage after proofreading is complete. These changes are preferentially done by a computer program.
Jobs for a computer
- Regularize forms of God's name from source-specific forms to the project-uniform yod-sheva?-heh-vav-[qamats|hiriq]-heh . If a different form was used in the original text, it should be encoded under a tei:choice/tei:orig,tei:reg containing the original and regularized forms, respectively.
- Correct all incorrect uses of the holam haser. vowel-vav-holam haser (pronounced "vo") should become vowel-vav-holam haser for vav . The normal holam haser mark should only be used with vav when it is intended to make a holam male vowel letter (pronounced "o").
- Correct the ktiv form of Yerushalayim to follow proper Unicode conventions: it should read yod-sheva-resh-vav-dagesh-shin-shin dot-qamats-lamed-[patah|qamats]-meteg-CGJ-hiriq-mem sofit
- Correct the dash character to the maqaf character in connected words.
Jobs for a human
- Split the text into logical segments so a computer could pre-encode the text repository