• Self-Publishing
  • Author Blogging
  • Sitemap
  • Fonts/Typography

The Book Designer

Practical advice to help build better books

  • Home
  • About
  • Articles
  • Contact
  • Shop
Home » Manuscript to Ebook: A Cleaning Guide

Manuscript to Ebook: A Cleaning Guide

by Joel Friedlander on September 9, 2015 29 Comments

By David Kudler

Over the last couple of months, I’ve been talking about just what an ebook is, and four basic methods for creating them.

This month, I’m going to get a bit more into the nitty-gritty — how best to prepare your manuscript for conversion.

Whichever of the methods you use to create your ebook, it’s essential to have the original file be as clean as possible.*

What do I mean by that?

Basically, it comes down to one thing:

Keep It Simple with Styles

The most important thing you need to do to your text is to make sure that all of the formatting is simple and consistent. By simple and consistent, I don’t mean minimalist — I just mean that:

  • all of the body paragraphs look the same
  • all of the chapter titles and section heads look the same

Most manuscripts that come to me don’t look that way. Sometimes, they look as if they’ve been fed through a wood chipper.

As we write, we pull text from various sources:

  • quotes from the internet
  • snippets of text that we typed into the phone
  • chapters written in different apps

Or perhaps we play around with how the text looks:

  • sometimes paragraphs begin with an indent
  • sometimes with a tab, sometimes with four spaces
  • sometimes with two returns in a row

Maybe we try out different fonts to see how they look; if a project has taken a long time, our tastes may change and the typeface we used for the body text may have changed, sometimes more than once: Palatino, Times, Times New Roman, Cochin, Helvetica, Comic Sans.

Maybe you found that your eyes were getting tired and so you upped the font size from the standard 12 points to 14; maybe a line was too long and so you decreased it to 11 points. I’ve seen all of this.

Unless you’re looking closely, this can make your book look like a crazy quilt.

Here’s the thing: we don’t want our font changes to be local. We want them to be global — uniform throughout the book. And so we don’t want to tag each paragraph with a typeface, size, and weight (i.e., bold, semi-bold, regular, or light).

If we do that, once it gets imported into the ebook, if we want to change how the text looks, we have to make the change for each individual paragraph — even if most of the paragraphs are the same, if just a few are different, they’ll stand out like the proverbial sore thumb.

Styling with Styles

The way to avoid this is by using the Styles function in whatever app you’re using to get the manuscript ready. †‡

In Word, the Styles tool is part of the pop-up Toolbox:

Styles

See that list on the right? Each one of those items is a style — a global set of choices about what kind of font to use where. If you’ve worked with HTML at all, this is something like a stylesheet.§

  1. The first thing we are going to need to do is to make sure that all of the text is the same typeface and size, with the same margins.

    Yeah, I know: painful. You spent hours getting those Zapfino captions looking just right.

    Sorry.

    Here’s the truth: none (or close to none) of the formatting that you’ve done will translate properly to an ebook.

    Remember: there’s no way to predict what size screen your reader is going to be reading on. And if whatever conversion technique you choose does manage to make the ePub version of the text look more or less like the Word (or Pages or OpenOffice or InDesign or…) document, if you’ve used local formatting, the text will be difficult to style later, and won’t display consistently across different platforms — on a Kindle Fire, say, or an iPhone, or a 27” monitor.**

    So we’re going to strip back the text to the bare studs. The only formatting you’ll want to keep is italics and boldfacing. ††

  2. Before we destroy all of your beautiful fonts, save. Now save the file again under another name — add clean at the end of the filename, or something to let you know which version to use.
  3. Select the whole manuscript (press ctl-A in Windows and command-A on Macs). In the Styles tool, select the Normal style.

    Yeah. Sorry.

    This will have gotten rid of most of the formatting — though not those character style changes I talked about: italics, bold, etc.

  4. Select the whole text again and make sure that it’s all the same font and size — choose a common font like Times or Helvetica so that it will display all of the character styles correctly.
  5. Click on the New Style… button at the top of the Styles tool, and name the style something like Body or Text.
  6. With all of the text still selected, click on the Body style button that has appeared in the Style tool, so that the entire manuscript is formatted as Body paragraphs.

    Why have we done this? Because the huge majority of your book is going to be regular body text. We’re going to add the other styles back in a minute.

    You probably don’t need to worry about the way that the style looks. If you want to play with the side or bottom margins or the first-line indent, go for it‡‡ — but don’t get too fine about typefaces, letter-spacing, line-spacing, etc. We’re probably going to have to set a lot of those after the conversion anyway — and here’s a difficult truth: many of the styles you specify will be overridden by the reader’s user preferences.

    The typeface and size, the background color — unless you know what you’re doing (and sometimes not even then), the font display preferences that a reader chooses will trump many of the design choices you’ve made.

Adding Styles Back In

Now, you’re going to need to go through your whole book and set the style for each paragraph that isn’t a body paragraph. Word has some of those styles built in:

Styles

And so on.

  1. To help you find paragraphs that you want to style, open your old version of the manuscript. (This is one reason that folks like me like to have two monitors.)
     
    You may find that there are styles that aren’t accounted for in Word’s standard style library — perhaps you want extended block-quotes (aka extracts) where the font size is smaller and the side margins wider. Perhaps you have call-out quotes or sidebars. Perhaps you need captions, or your characters exchange letters, or there are text conversations and you want to make them look as if they’re being conducted on an iPhone: **

    Dialogue

  2. Select the paragraph you want to add the style to.

    If it’s something simple (a wider margin and smaller font-size, for example), make the change. Don’t get too typeface-happy! If you use more than a couple of basic typefaces (one family — typically a serif typeface — for the body text and another — typically sans-serif — for the headers, not only will they probably disappear after the conversion, but if they don’t, your book will look like a ransom note.

  3. Click on the New Style… button, name your style, apply the style to the current paragraph (don’t forget this step!), and move on.

    The next time you find a paragraph that needs to fit that style, just select the text and click on the new custom style in the Style tool. Voilà! Your paragraphs are now formatted consistently.

  4. Once you’ve gone through the whole manuscript, go through it again — make sure that all of the block-quotes have the Block-Quote style applied, that you haven’t styled any chapter titles as Heading 2 instead of Heading 1, and that the whole thing looks consistent. (You may also see some errors now that the typeface is different.)

    You know what? If you have the time, it’s okay to go through it a third time. 😉

Squeaky Clean

The last step in cleaning up your manuscript and getting it ready to become a book is to get rid of extraneous white space. We’re going to use styles to create indents and margins.

  1. Use the Find and Replace dialog to change every double space to a single space. If you had more than two spaces in a row, there will still be some there. Do it again and again until Word tells you it can’t find any.
  2. Do the same for double paragraph breaks: enter “^p^p” in the Find dialog and replace them with single breaks (“^p”). If you tended to add indents to your paragraph with the tab or space bar, replace “^p “ and “^p^t” with “^p”
  3. Repeat until clean.

There you go. Your manuscript is now consistent and ready for conversion — congratulations!

We’ll be discussing how to use HTML stylesheets after conversion so that your text displays correctly in coming months.

But next month: images.


*By the way, all of these suggestions apply to preparing your manuscript as a print book. Just saying! It will save you lots of time — or, if you’re hiring a designer, lots of money.
†I’m going to use Microsoft Word as the example, but just about every piece of text editing software, from barebones apps like TextEdit to high-end apps like InDesign has some variation on this tool. If you can’t find it, go to the software developer’s website, find the support area, and do a search on Styles — directions to how to add styles to your document should be there.
‡If you’ve already been using styles consistently, congratulations! You’ll still want to go through and make sure that all of your styles have been applied properly. Skip down to Adding Styles Back In.
§This is not precisely true. HTML styles (CSS) cascade — the style of each paragraph inherits any undefined formatting from the section that it’s in. In Word, the formatting for one Body paragraph is the same as any other Body paragraph.
**And that’s not even mentioning different apps.
††Not usually underlines! Typesetting fact: in printing, an underline simply denotes make this italic. In computer documents and ebooks, it means this is a hyperlink. So consider changing all of your underlined text to italics. If you’re used to marking book titles, etc. with underlines… I’m sorry. But unless you’re using the style for extreme EMPHASIS — and probably not even then — avoid underlines. Use italics instead.
‡‡Just don’t try both a bottom margin (a space after a paragraph) and text indent. They both mean new paragraph. Using both is redundant and a bad look
§§This was all done using HTML styles in the ebook. The left and right blocks are two different style with different margins and background colors. The author had styled the paragraphs so that it made creating the look we wanted relatively easy. (I don’t recommend this particular trick by the way — it’s difficult to get to work consistently!)

David Kudler headshot x 125David Kudler is a Contributing Writer for TheBookDesigner.com. He is also an author, an editor, an ebook designer and a writer for the Huffington Post.

You can learn more about David here.

 
Photo: bigstockphoto.com

Filed Under: Contributing Writers, E-Books & Readers Tagged With: David Kudler, e-books, ebook conversion

journal marketing

Comments

  1. John Rothra says

    May 10, 2018 at 9:04 pm

    This is a great resource and I’ve been following it. However, I’ve run into one problem: numbered lists. I’m using Calibre to convert my docx into MOBI and EPUB. I noticed on my Kindle for PC that the numbered lists are on the far left. In both cases the numbering is within a block quote. I used one of the Styles to create the block quote, and the created a copy that would include lists. One of the lists is an outline (I created the subpoints by hitting the “Indent” feature, not the Tab key). To help it be clear, here are screen shots:

    MS Word Outline using Style: https://www.johnrothra.com/wp-content/uploads/2018/05/1X-Word.jpg
    Kindle results: https://www.johnrothra.com/wp-content/uploads/2018/05/1X-Kindle.jpg

    However, a non-list block quote converted fine: https://www.johnrothra.com/wp-content/uploads/2018/05/1X-Kindle2.jpg

    What do I do?

    Reply
  2. Dean Johnston says

    January 21, 2016 at 5:33 pm

    Hiya!

    Maybe I’m stupid (it has been noted), but I cannot seem to figure out where the “next time I’ll…” stuff lives. Or, even where this article belongs / can be found on the site.

    Thanks.

    Dean

    Reply
  3. Katrine Geneau says

    October 12, 2015 at 12:39 pm

    Joel, I recently purchased one of your templates – Spark – and am wondering if I need to “strip down” my manuscript first as shown above?

    Thank you,
    Katrine

    Reply
    • David Kudler says

      October 12, 2015 at 4:52 pm

      I don’t speak for Joel here — he’s the maestro.

      But I’m going to guess that in order for your manuscript to import cleanly into one of the templates (whether for Word or for InDesign) that yes, you’re going to need to clean up any peculiarities that have crept in, or they’ll be imported into the template, and you won’t get the layout that you wanted.

      If you’re using a Word template, of course, you could also simply apply the template and then go through chapter by chapter and paragraph by paragraph and apply the proper style.

      Reply
    • Tracy Atkins says

      October 19, 2015 at 1:07 pm

      Hey Katrine,

      The templates are made you can import just about any text. Once in the template itself, you can apply the built in styles and that will do the bulk of the cleanup. We do advise removing things like extra enters that may not be needed (like having 2 between a paragraph.) and similar. Our formatting guide that comes with the templates is a great place to start.

      Reply
  4. Gail Lenhard says

    October 12, 2015 at 6:40 am

    Thanks David! This article is great. May I share it with my Critique Group?
    🙂

    Reply
    • David Kudler says

      October 12, 2015 at 4:41 pm

      Of course, Gail! Please do. And let them know where you found it. 🙂

      Reply
  5. Katrine Geneau says

    September 14, 2015 at 5:50 pm

    This has been a very useful article. Thank you so much!!!

    Reply
    • David Kudler says

      October 12, 2015 at 4:41 pm

      Sorry that I didn’t see this earlier — thanks, Katrine! Glad it was helpful.

      Reply
  6. John Blommers says

    September 9, 2015 at 4:55 pm

    All of this trouble is because writers still use word processors for writing. I recommend and use a markdown editor and completely avoid all of the problems this article addresses. Plus I can easily export to HTML, PDF, ePub, even open document with the editor.

    Reply
    • Joel Friedlander says

      September 11, 2015 at 2:17 pm

      John, I’m curious which markdown editor you use?

      Reply
      • John Blommers says

        September 11, 2015 at 4:19 pm

        Joel, There must be a dozen markdown editors in my Applications folder, some free/open source and some paid for. HarooPad is one that gets used the most. There is also Atom, Mou, Hemingway, GitBook Editor, LightPaper, MultiMarkdown Composer, Marked2, Markdown Plus, and MacDown.

        Also in the tool chest sit Calibre, Sigil, the command line tool Pandoc, and of course the venerable Scrivener.

        Reply
        • David Kudler says

          September 13, 2015 at 10:29 am

          Since designing and editing ebooks is a large part of how I spend each day, Sigil has ended up being my de facto text editor. I can view the file as HTML or WYSIWYG (“what you see is what you get”) — or I can open the Preview panel and view code and formatted text at the same time.

          There are some downsides — no .doc export, no automated footnote/endnote function, and no autosave, but since I have it open nearly all of the time, yeah: Sigil works just fine.

          (It’s also extensible — there’s a part of me that wants to write an endnote plug-in… if I can dust off my Python programming skills. 🙂 )

          Reply
        • Joel Friedlander says

          September 13, 2015 at 11:18 am

          Thanks, John, much appreciated.

          Reply
    • David Kudler says

      September 13, 2015 at 10:24 am

      John,

      True. To be honest, since so much of my own writing at this point is aimed at HTML (either in ebooks or the web), I often write in an HTML editor.

      However, the vast majority of authors continue to work in a WYSIWYG text editor like Word, Pages, or OpenOffice — they don’t want to see the code, they want to see the formatted text. And so nearly all of the manuscripts submitted to me come in .DOC or .RTF files. And so that was the reason that I shared these (admittedly exhaustive) suggestions.

      Reply
      • laura says

        October 19, 2015 at 9:46 pm

        Thanks…I wrote a book 25 years ago and my copyright just ran out. I want to resubmit it and need to format it.:-P

        Reply
  7. Terry Gibson says

    September 9, 2015 at 3:17 pm

    Thanks for the tips, David. Extremely useful.

    Reply
    • David Kudler says

      September 13, 2015 at 10:25 am

      You’re very welcome, Terry!

      Reply

Trackbacks

  1. From The Book Designer: Manuscript to Ebook: A Cleaning Guide – JDs Writers Blog says:
    September 15, 2020 at 1:16 pm

    […] Manuscript to Ebook: A Cleaning Guide […]

    Reply
  2. InDesign Cheat Sheet! Add Some Text! | Just Can't Help Writing says:
    June 17, 2016 at 10:47 am

    […] Here’s help with Word’s Styles process. As one of the footnotes tells us, this is the way to “clean” text for print as well as for ebooks. […]

    Reply
  3. Preparing Images for Your e-Book by David Kudler — The Book Designer says:
    April 1, 2016 at 1:00 pm

    […] month I discussed how to clean up your manuscript to prepare it for ebook conversion. This time I’m going to be looking at how to do the same thing with images. […]

    Reply
  4. Manuscript to Ebook: A Cleaning Guide by David Kudler — The Book Designer | Toni Kennedy : A Writing Life says:
    February 9, 2016 at 7:45 pm

    […] Sourced through Scoop.it from: http://www.thebookdesigner.com […]

    Reply
  5. Manuscript to Ebook: A Cleaning Guide by David ... says:
    February 8, 2016 at 5:06 pm

    […] “ Manuscript to Ebook: A Cleaning Guide by David Kudler explains how best to prepare your manuscript for conversion and why it is essential to have the original file be as clean as possible.”  […]

    Reply
  6. Formatting Service for Self-Publishing Ebooks and Print on Demand says:
    January 12, 2016 at 3:39 pm

    […] first step is to clean up your manuscript in Word, simplifying the text styles in preparation for ebook […]

    Reply
  7. Want to create your own e-book from Word, etc.? Here’s a great overview of conversion software options | TeleRead says:
    November 21, 2015 at 6:54 am

    […] Kudler on preparing your manuscript and your images for the e-book-to-be, as well as 4 Ways to Create an ePub eBook. Also see The […]

    Reply
  8. Writing RoundUp 11/01/2015 | Killer Ink Press says:
    November 1, 2015 at 5:39 am

    […] Manuscript to Ebook: A Cleaning Guide by David Kudler […]

    Reply
  9. Preparing Images for Your e-Book says:
    October 16, 2015 at 12:27 am

    […] month I discussed how to clean up your manuscript to prepare it for ebook conversion. This time I’m going to be looking at how to do the same thing with images. […]

    Reply
  10. Top Picks Thursday 09-17-2015 | The Author Chronicles says:
    September 17, 2015 at 10:01 am

    […] lot of people are self-publishing now. If you are a do-it-yourselfer, David Kudler explains how to clean your manuscript for e-book formatting, and Emma Barnes discusses the idea of a publisher’s technical debt. If you want some help, Jane […]

    Reply
  11. Manuscript to Ebook: A Cleaning Guide by David ... says:
    September 14, 2015 at 12:57 pm

    […] Manuscript to Ebook: A Cleaning Guide by David Kudler explains how best to prepare your manuscript for conversion and why it is essential to have the original file be as clean as possible.  […]

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Join Me & Subscribe!

  • Facebook
  • Instagram
  • LinkedIn
  • Pinterest
  • Twitter
  • YouTube

Topics

  • Audiobooks
  • Author Blogging 101
  • Book Construction Blueprint
  • Book Design
  • Book Printing
  • Book Production
  • Book Reviews
  • Contributing Writers
  • Cover Design
  • E-Books & Readers
  • Editorial
  • Guest Posts
  • Interior Design
  • Interviews
  • Journey of a Book
  • Legal Issues
  • Marketing
  • Podcasts
  • Project Focus
  • Reports
  • Reviews
  • Samples
  • Self Publishing Basics
  • Self-Publishing
  • Social Media
  • Training
  • Video
  • Webinars
  • Writing
  • Home
  • About
  • Articles
  • Contact
  • Shop
  • Self-Publishing
  • Author Blogging
  • Sitemap
  • Fonts/Typography
Terms of Service
Privacy Policy
Comment Policy
Guest Author Guidelines
Why?
"Writers change the world one reader at a time. But you can't change the world with a book that's still on your hard drive or in a box under your bed. This blog exists to help you get that book into people's hands."
—Joel Friedlander

Copyright Marin Bookworks © 2021

For permission to reprint articles from this site, please contact [email protected]

Marin Bookworks 369-B Third Street #572 San Rafael, CA 94901