Write in Latex, submit in an absurd format.

The point of this page is that any work that may be converted into PDF can be submitted as an m$-word document; this might be useful to fulfil some organisations' idiotic requirement of this file-format. One can split the pages of the PDF, convert them to EPS, and include them as full-page `graphics' in an ms-word document... I've used this trick twice, and got away with it, though I usually try to avoid in the first place any organisation that considers such a silly restriction. There are examples of this page (as of September 2008) made into PDF by Latex and then into an ms-word file: see files here.


Note: There are (apparently, scanning the web) plenty of converters or `importers' available, to try to get a word-processor to swallow a Latex source-file and make it into its own format. One example is TeX4ht, which provides the command htlatex and some abbreviations of its calling for particular conversions, such as oolatex that converts a dvi file to Open Document (e.g. for use in OpenOffice, which can also export to many other formats). I've had no fun with this, and have fallen back to latex2rtf followed by plenty of work, when experimenting. These converters are not covered further here: the point is that one wants one's nicely typeset work to be preserved, but is obliged to submit in a form that cannot do that except as a (vector)image representation, and wants to do a quick and reliable conversion without then checking every cross-reference, font, etc., as seems prudent when converting in a way that has to interpret one format into a very different one. One would expect that latex TO wordproc is better than wordproc TO latex in the matter at least of headings etc., since so many word-processor users use appearance-related rather than structure-related formatting; but still, there are likely to be imperfections in how different packages' effects are interpreted -- these conversions always have a problem. The suggested method doesn't have problems in this way: its problems are non-modifiability, lack of document structure, and possibly weaker appearance of the type when printed.


Recipe:  If one really wants to submit a document to a misguided organisation that demands a proprietary `word-processor' format, without having to endure the hardship of converting nicely typeset work from Latex (or indeed from any other program that can generate PDF or PS), the following may be of help:

Note that for printing or PDF-export of the .doc file, the result can be very good, when the EPS contained proper vector representations of the pages' text, and these have been preserved.



Incredible though it may seem, even some moderately `technical' (i.e. maths-using) conferences or journals! now require submission of publications in ms-word format. For example, IEEE Transactions on Dielectrics and Electrical Insulation has this requirement (a reason given was that is is necessary for speeding up submission, though it's quite incomprehensible how this requirement is necessary for that aim), and the International Symposium on High-Voltage Engineering in 2007 listed this requirement on its webpage (a reason given was that the `conference software' being used had this limitation, and this is reasonable as the software is used by such-and-such percent of conferences -- note no mention of scientific conferences...).

The rank stupidity of these requirements has several aspects to its causes and consequences. As more and more people think themselves conversant with computers, the idea becomes ever less natural that one should, as an editor or organiser, expect to consult with someone who might actually have a clue about available programs and formats and their advantages and disadvantages. People get used to the idea of starting a particular program, bundled with their computer, every time they're going to write anything, from a quick note to a paper or thesis. They expect everyone else will too. They haven't tried other concepts of typesetting, to find out their relative ease, nor have they thought of what a crazy situation we're in when much of the world is paying ludicrously high fees for using a proprietary writing-program when there exists an excellent Free alternative (OpenOffice) and other quite different, and generally preferable, ways of getting a typeset paper. Since the current situation is so often that most people do have a particular operating system and word-processor, and haven't explored much further, there's a good chance that many will feel satisfied with these restrictions. Many people will happily swallow almost any such requirement without being bothered by its folly and lack of reasoning. Feedback happens only weakly.

The bad points about a requirement of msword as a format for submission are at least the following. It is specific to one word-processor, or arguably one or so others (openoffice is pretty good at saving to this format), while a print-type format such as pdf can be generated by pretty much any program that one might use to write a document. This specificity encourages people to buy the right to use this highly priced junk, which is particularly bad when they are in situations where money is a problem. Even if the availability were no trouble, the `WYSIWYG' word-processor is in many ways a bad tool for the job of writing most technical works; a mark-up system such as Latex allows easy input of equations without add-on programs or lots of mouse-clicks, it handles references easily, and it doesn't move text around all the time as one types -- the content is the focus, and typesetting it all comes at the end, and (not least important), its output looks pleasing rather than atrocious. But even if authors want yet another program, e.g. some other more WYSIWYG thing which they're used to, there needs to be a really pressing need for a specific word-processor file format in order to inflict the requirement that all users try using a particular program that some of them probably hardly know or intensely dislike! Apart from this, m$-office formats are well-known for their inability to show a document in the way it was seen on the author's computer -- plenty of conferences' presentations and proceedings bear witness to this, with bizarre movements of figures, extra characters (e.g. Greek letters) replaced with squares or dots, certain corporate logos replaced with funny squiggles, etc. -- PDF is at least a much less unreliable format, even if not perfect.

Is there any reason why one might see an advantage to m$-word format, from the point of view of a clued-up editor of a journal? Not having to download so-called `acrobat reader' (if working on platforms that don't already have plenty of PDF readers) might be a slight boon to someone with a modem... Being able to make large changes to the document might be desired, but I see very little justification for this -- the job of the editor and reviewers is to report their views, not to send back a `corrected' version. Available PDF manipulation programs allow PDF files to have suitable headers, footers, and combinded page-numbers inserted, as is shown by all the many conferences and journals that have PDF submission and yet achieve all of these miracles; having one copy of such a program, even if it's a `licensed' sort, can surely be managed by a journal or conference. In short, I see no compelling reason why PDF should not be an accepted format; people just don't realise what a lot more versatile it is in terms of which programs could produce it, and what can be done with it at the editing side. Allowing PDF needn't mean limiting other formats -- one could still be allowed to submit ms-word if that were considered desirable for the authors; there are, anyway, free websites that offer conversion services.

Some links. I wouldn't claim complete agreement, but there's a good deal of truth in them. MS-Word is Not a document exchange format, and Word Processors: Stupid and Inefficient.


Page started: 2008-09-20
Last change: 2010-02-06