What is PG-TEI and why is it being developed?
Joshua Hutchinson on Apr 2nd 2007
PGTEI is first and foremost a markup language. Just like HTML puts tags around words to give them some sort of meaning, PGTEI puts tags around words, too. Where a HTML document would italicize a word with <em>emphasis tags</em>, PGTEI would italicize a word with <emph>emphasis tags</emph>, too. Same basic principle, different vocubulary.
So, why not just stick with HTML? A lot more people know HTML and a lot more tools exist to help people use it. And everyone has a web browser that is custom made to read the resultant file, right?
Well, PGTEI solves some problems that may not be immediately obvious to someone coming at it from an end-user/consumer role. Where PGTEI helps the most is during creation and distribution.
The Pain That Is Content Creation For Project Gutenberg
When Michael Hart first started Project Gutenberg, there was one file type: plain ascii text. (In fact, the original files were all upper case plain ascii text, which were converted years later to the normal upper/lower case documents we’re probably all familiar with.) It wasn’t all that long before someone realized that while plain ascii text is great for English language documents, it doesn’t handle letters outside the ASCII character set. So, we started seeing documents in different ISO character sets. In more recent times, UTF-8 has gotten popular as a character encoding since, theoretically, it’ll handle every character you could ever want to throw at it.
Filed in eBook Creation Help | No responses yet