Timothy Wynn

Bug report: Remove prettified code transformation for epub downloads

Rate Topic 0

4 posts in this topic

I'm not exactly sure where this bug report should go, or if the forums is the right place to do this sort of thing, but if whichever processor Bookshare uses to process the epub downloads, it would be of great help if the prettifying transformations were disabled so that there would be a proper render for some problematic books.

Because I purchased this Book from a commercial ebook source, I wanted to compare the epub output with the one from Bookshare, since the book in question was publisher quality. Reading the Bookshare version had a lot of truncated words, so I opened the same book from the commercial source, and no truncated words. Opening the source in a text editor, the only major difference between the sources is that for Bookshare, every HTML tag has been prettified, i.e., they are on their own line with indentation. If I removed the prettification manually, i.e., the only tag where line breaks exist in the source are for paragraphs, the epub render is identical to the commercial source.

The reason for the truncation is that for almost all renderers, a line break in the source is considered to be a space in the output. This is problematic when there are format changes mid-word, e.g., if I bolded the letter `O' in "word". Because each nested tag is on its own line, spaces would be inserted into the output, such that the output would be "w o rd", for example. This makes for an irritating reading experience, which can be easily solved if the original source for the epub output is left untouched, i.e., no source prettifying.

I am not sure how much control Bookshare has on its epub generation pipeline, but I figure this would be a bug report worth bringing up if for nothing else than to bring awareness to the issue.

Share this post

Link to post
Share on other sites

Hi Timothy,

Thanks so much for sharing this. In order to directly alert our Collection Development team who handles all book scanning and collection maintenance to issues like this, you can always submit a Book Quality Report. You can submit a Book Quality Report by following the steps below:

1. Log in to your Bookshare account at: https://www.bookshare.org/
2. Locate the book.
3. Select the Book's Title.
4. Select the Report Book Quality Issue link.
If you choose the option to be notified, you will receive an email from the Collection Development Department when an update is available.

Learn more about these reports here: https://www.bookshare.org/cms/help-center/report-book-quality-issues

Share this post

Link to post
Share on other sites

Hello Heather,
Thank you for letting me know. I have filed the report, though it is not specific to just one book, but an actual pipeline issue which *can* can effect other books as well if style or format changes are done mid-word. Do I need to delete this topic so as to not clutter this forum? Thank you once again for quickly responding with the proper place to report these kind of things.

Share this post

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Sign in

Already a Bookshare Member? Sign in here.

Sign In

Not a Bookshare Member?

Join the Bookshare Discussion Forum, It's easy!

Register a new account