e0: EPUB Zero: a radically simple(r) e-book format

Thursday, February 14, 2013

EPUB Zero: a radically simple(r) e-book format

Earlier this week, I participated in the W3C Workshop on Electronic Books. A common theme was the complexity of the EPUB3 specification, how difficult it was to implement, and how few implementations exist.

These ideas were expressed most forcefully by Daniel Glazman (slides available here as PDF). I'd been familiar with some of his thoughts, as he'd posted extensively about his experience with EPUB3 as he implemented an EPUB3 editor. His rant about the absurd number of navigation files particularly resonated with me. Why do we need a manifest, a spine, an NCX, a nav document, landmarks, and guides?

What would an e-book look like, if we tried to avoid as much complexity as possible? The idea wouldn't go away. I try to avoid abstract thinking, and so my natural reaction was to build a sample book and see what happened. So I'm in the middle of that process, with Moby-Dick, of course. Let me know if you want me to email you a copy.

PRINCIPLES

The goal is for an e-book to be as simple as possible, and as close to the web as possible. Is it possible to make an e-book without any e-book-specific features? Do we need anything beyond bog-standard HTML5, CSS, JavaScript, SVG, MathML, and media? I'd like to find out.

Another goal is to make authoring easier. I wonder how much of the complexity of previous e-book specifications was to make life easier for reading system developers, who have course been the major participants in the standards bodies (not that there's anything wrong with that!).

INSIDE EPUB ZERO

The Container

An EPUB Zero file is a zipped folder containing only content files. It's identified by the file extension. There is no mimetype, no META-INF, no container.xml. And so the zip process is much simpler, your operating system's zip command should work without changes. None of this

zip -v0X $FOLDER mimetype
zip -vr $FOLDER * -x $FOLDER mimetype

bullshit.

The Package

The heart of an EPUB Zero is the index.html file. The reading system (if they were ever to exist) would look inside the zip for index.html. This file provides navigation (via the nav element), defines the order of content documents (via the nav element!), and contains document metadata (see below). Not all content documents need to be in the nav element; if they're in the zip you can reference them via links and have an "out of spine" item. Same goes for images, as well as audio, and video (perish the thought).

An open question is how to define what happens when the book first opens. You might not want to see a complex table of contents as soon as you open a book. Perhaps if the nav element is hidden, the reading system would then just open the first document referenced by nav.

Metadata

I'm unsure how to handle metadata. My first thought was to use the head of index.html:

<meta charset='utf-8'> 
    <title>Moby-Dick</title>
    <meta name="dcterms.creator" content="Herman Melville"/>
    <meta name="dcterms.title" content="Moby-Dick"/>
    <meta name="dcterms.identifier" content="x9780000000000"/>
    <meta name="dcterms.language" content="en"/>
    <meta name="dcterms.modified" content="2013-02-14"/>
    <meta name="dcterms.publisher" content="Harper & Sons"/>

That strikes me as inadequate. Would this be enough for the very simplest cases, in conjunction with some sort of link to an ONIX record? The middle ground here seems like dangerous ground, as we always want to handle "one more thing…"

Content Documents

All content documents are HTML5, which of course can contain SVG and MathML. I reserve the right to use the XML serialization for any EPUB Zero I produce :)

WHAT IS AN EBOOK?

When you go to a website, you navigate through the content by clicking on links, going from page to page. What makes an e-book different is that the sequence of pages is defined ahead of time, and the reading system helps the reader navigate. Does this mean that a plain web browser won't work as a (packaged) e-book reader, without some sort of extension or scripting?

WHAT NEXT?

I haven't thought about how accessibility would work in this context, or digital rights management (could a digital signature work with this file structure?), or many other things. Maybe the ultimate answer is that we need the complexity of the existing specs. But I'd prefer to be convinced that this is too simple, rather than assume that what we have is just right.

Dave (writing, of course, as a private citizen)

21 comments:

joefrizzellFebruary 15, 2013 at 4:57 AM
Interesting concept. I'm a big fan of the KISS ideology. I think one of the reasons the current EPUB standard is so sprawling is to keep the new players in line. The industry is beginning to accept the idea that EPUBs are just websites. Maybe with that acceptance there will be a return to existing web standards, rather than trying to reinvent the wheel.
ReplyDelete
Replies
UnknownFebruary 15, 2013 at 6:55 AM
with this model looks very promising. There could be some amount of implied packege-ness if there is content document level metadata placed in the head. (This CD is a component of EPUB package with pointers to package identifier and CD identifiers). This allows each content doc to both stand alone and be part of package.
ReplyDelete
Replies
UnknownFebruary 15, 2013 at 9:35 AM
Very good idea and something that I have been experimenting with myself.
I think the best approach is a classic graceful degradation - if you open ePUB Zero in a browser it should just work.
If you open it in a browser with Javascript support, perhaps all the reader functionality can be overlaid by rearranging the DOM a little bit?
ReplyDelete
Replies
AnonymousFebruary 15, 2013 at 3:24 PM
dave, this is all you should need for moby dick:
> http://www.gutenberg.org/cache/epub/2701/pg2701.txt

-bowerbird
ReplyDelete
Replies
UnknownFebruary 15, 2013 at 7:11 PM
I was also attending the IDPF/W3C workshop and I'm glad that you created this blog to discuss things.

Daniel raised many interesting points, but most of them are due to a single thing: compatibility. We kept many things in EPUB 3.0 for compatibility reasons (too many to list), had to settle for less than ideal solutions for other things (reference to WD for many specs, support for JS) and took some bad decisions too (metadata).

Navigation/Package

You mention that the navigation file provides both the navigation (NCX in EPUB 2, navigation document in EPUB 3) and the order of the documents (spine in OPF in both EPUB 2 & 3).

I'm perfectly fine with that but it's not clear if you mean that both the navigation and the order should be available in the <nav> element or if you consider that the navigation document is good enough for both.

While I agree that NCX and guide should be deprecated (which they are in EPUB 3) and I doubt the real usefulness of the manifest, I still see a clear benefit in separating the spine from the table of contents.

A spine defines the order of the documents while navigation provides a set of links to various resources, including specific fragments of a document.

Instead of limiting both the spine and navigation to HTML5 documents, I believe that we should authorize HTML5 + all core media types for images/audio/video to appear in both elements.

That said, we don't need a separate OPF element just for the spine, a separate list in the same navigation document would work fine.

Metadata

I'm not a fan of your proposal for metadata, repeating the meta element all the time is every bit as ugly as what we have in EPUB 3.
I liked how things worked in EPUB 2.0 better: a parent metadata element where all the child elements are the metadata.

To avoid the messiness of an ID/IDref system, we'd use additional child elements to refine things.

As for external resources, we should rely on the link element with the proper mediatypes and link relationships.

The current suggested list of rel values in the EPUB 3.0 specification reflects a poor understanding of how links work.
"marc21xml-record", "mods-record", "onix-record" and "xmp-record" should all be replaced by a generic "record" relationship along with the proper media type (which would make it extensible to anything else that can be identified with a media type, such as OPDS for example).

The EPUB specification should also mention the link registry that was officially created with RFC5988, and use URLs as the proper extension mechanism.

Finally, and this might be a little controversial, I believe that any metadata that we include in the package (which would be the navigation document too in EPUB Zero), can also be used the same way in any content document to define document-level metadata.

What happens when you open such a file

Displaying the navigation document when you first open such a file sounds like a bad idea: that's not what the user would expect to see.

Opening the first document sounds like a much better idea. That said, one of the thing that I really hate right now in EPUB 2.0/3.0 is that many publishers use the cover as a non-linear element that you never see when you open a book, or can even reach using navigation.
For some books (comics for example), the cover is every bit as part of the experience as the rest of the book. I'd like to see a mechanism that at least let the user decide if they'd like to see the cover or not when they open a book.
ReplyDelete
Replies
Bill McCoyFebruary 16, 2013 at 8:03 AM
Dave, your thought experiment sounds a lot like existing alternatives HPub and Zhook. These haven't particularly taken off which is to me a relevant datapoint.

Of course the main enemy of simplicity is backwards compatibility. But we don't need a new format for this, we just need to agree we don't need the benefits of that backwards compatibility and then stop using the parts that are only there for that backwards compatibility, and et voila, we have a simpler EPUB. For example if you want to avoid non-web vocabularies then with EPUB 3 we can use just the nav document which is already HTML5 and forget about NCX. Ditto Guides. We don't have to step back and ask "could these things be expressed in HTML5?" - we know they can, the work to figure it out is already done. We could make some of the redundant manifest data structures optional... no need for manifest and spine and nav if all the information is in the nav and nothing extra is provide by the rest. This was even discussed in EPUB 3 WG but again backwards compatibility led to a decision not to go there.

The big one is allowing HTML, i.e. not requiring XHTML. Since EPUB 3 reading systems will be built on browser engines there is no reason in principle we couldn't relax this constraint and immediately achieve far more alignment with the rest of the Open Web Platform. But it would have dire consequences for toolchains that want to manipulate EPUB files and since much of publishing industry uses XML heavily - and we need most of all more tools for which reliable formats are good not bad - this one is tricky for me. But this is not about simplicity - it is more complicated to have two serializations of content, HTML and XHTML, not simpler. I can't believe for example that Daniel Glazman would think that his job of writing Blue Griffon would have been easier if the content wasn't XHTML..
ReplyDelete
Replies
RichardIGPFebruary 16, 2013 at 9:55 PM
Dave, the thought experiment is great and worth pushing along. The full ePub3 spec makes it potentially difficult and expensive to produce content and the minimal core is hard to find. The e-retailer reading systems are all so different and incomplete they take hours of laborious testing. Daniel's objections to ePub3 are pretty spot on. I think he stole my list! Rebooting the format is needed.

So the concept of ePub0 is core packaging corrections and the biggest challenge seems to be start-up and nav.

The index.html page is the opening page and the nav item may be hidden. That allows the index page to have a cover image, title page text, promo or anything else. The reading system must read the nav item to know where to go to next. The index nav is a variable-spine and TOC.

With this approach the index-nav list could contain the full section sequence for linear material or just have just one link to a start page where books use extensive internal linking. We use this heavily with text books. The spine is a big list of linear no's and the TOC links only to units.

An ePub is the wrong place to manipulate content and that should never be a reason for making the format more complex. However is there any reason why ePub Zero can't optionally be XHTML5 or does that remove the concept zero?

I would like to take a shot at creating an ePub0 for more navigationally complex material. It sounds like a dream come true for sophisticated presentation material that can move seamlessly from online to a secure package.

Not trying to force things but the specification at present is something like:

1. An ePub zero must be zip package of HTML5/XHTML5 with no errors (pass HTML5 validation tests)
2. The opening page must be named index.html
3. The index.html page will be displayed and must have a nav element which may be hidden
4. The index.html page can be listed as an item in the nav element. It can appear in any position in the nav.
5. The nav list defines the next-previous default navigation as well as the reading system presented TOC.
5. Document metadata is the meta statements in the index file as dc terms (can we say there must at least be a title, identifier and date).
6. The loading rule for the reading system is simply: If the epub zip package contains an index.html, open the index file, read the nav list. Wait for the user.

I know this has started as a thought thing but this would be very straight-forward to incorporate into AZARDI as we already have an HTML5 packaging and reading mode. It just means the reading system has to look for *.opf or index.*

Once the spec list is finalized we will include it in AZARDI Desktop. If you can get us the final rule set we can have it ready in AZARDI 19 Desktop due out in around four weeks.
ReplyDelete
Replies
Peter HatchFebruary 17, 2013 at 7:58 PM
So it seems like this could be even simpler, and I'm wondering if it'd be practical.

Why not have the entire book (except non-linear documents) be in the index.html? No need for a spine element then. Seems like it'd be simpler, easier to author, and more like the web. Would this have too much of a negative impact on performance?
ReplyDelete
Replies
bowerbirdFebruary 24, 2013 at 1:42 PM
dave said:
> But that caused problems for reading systems,
> which couldn't digest a 5MB html file all at once.

actually, the limit is 300k. not quite the same thing.
plus i doubt you actually have many 5mb .html files.

but it's plain stupid to let inferior coders hold us back.

> Today, I think that having a single content file
> would be very limiting.

you'll need to justify that statement better than you have.

-bowerbird
ReplyDelete
Replies
health shieldMarch 30, 2013 at 9:28 AM
finally, i'm found this tutorial. great !

visit my site http://sejutabuku.com
ReplyDelete
Replies
UnknownJuly 21, 2015 at 2:06 AM
Great blog nice n useful information , it is very helpful for me.

ePub3 Services in UK
ReplyDelete
Replies
UnknownOctober 24, 2018 at 4:15 AM
Thanks for your marvelous posting......Our Epub3 services are suitable for migration to the latest electronic publishing platform based on HTML5 and CSS3. Considered to be error free and advanced, the Epub3 conversion makes the Ebooks, a class apart in terms of usability and appeal. It is a recognized digital format by IDPF. For quick conversion work, to contacting us today.

Epub3 Services
ReplyDelete
Replies

Add comment