MS-Word is Not a document exchange format[1]1 [2]Jeff Goldberg original version: http://www.goldmark.org/netrants/no-word/attach.txt Typically you are getting this because you sent someone an email message using MS-Word or some other operating system or text-processing specific attachment. Alternatively, you may have placed MS-Word files on the web as the only means for getting at the document content. Contents [3]1 What's wrong with sending MS-Word files? [4]2 Alternatives [5]3 Where MS-Word is appropriate [6]4 Response to the ``it's the emergent standard'' refrain [7]5 History and related documents [8]5.1 Similar documents [9]5.2 Rants about MS-Word [10]5.3 Reaction so far [11]5.4 Shameless plug 1 What's wrong with sending MS-Word files? Requires proprietary software You are basically assuming that everyone has on their desktop the same software that you have. That often goes against the spirit of the Internet which is supposed to be about inter-operability of heterogeneous systems. That fact that one ``persistently predatory monopoly''[12]2 attempts to subvert that goal, doesn't mean that you should go along with it. Someone who sends me such mail is perfectly welcome to purchase for me a machine and software specifically so that I can read mail in that proprietary system. But I will still have the inconvenience of having to forward the file to a system I wouldn't normally use. Version problems Even for those who chose to use MS-Word, there are compatibility problems between various versions. Viruses MS-Word allows full macro-scripting. It is now the most common carrier for viruses. What this means is that embedded within a Word file can be a program which runs silently (or otherwise) on the recipients computer whenever they view the file. Are you happy with letting other people run programs on your machine? In one instance that I know of, a substantial portion of an MBA graduating class sent out résumés with a Word macro virus. I don't think that this helped their job prospects. But the particular business school had an official MS-Word policy. Size Often what would be just a few kilobytes of plain text is hundreds of kilobytes as a Word file. I find it interesting that MS-file browsers and emailers don't make it obvious to the sender how large particular files are. Prior version info Because of Word's system of doing version control, it is possible that recipients may see prior drafts of your document (which may contain confidential information). I've heard a number of ``friend of a friend'' stories about this sort of thing. In one case, a potential customer was given a quote for some product, and the quote was sent in an MS-Word file. When the customer viewed the version history, they found that a previous version of the document had been used for a quote to other customers, with much lower numbers. But since initially writing this, I have heard a number of first hand accounts. Some of which are below. Since I almost never read MS-Word documents sent to me, I will have to rely on the accounts of others. In a [13]Usenet news article, Alan Frame describes some of his experiences with this In the past, I've received MS Word documents from an agency, describing a job vacancy where they've refused to name the client - lo and behold, the document properties reveals all. And also Indeed, I've also seen an internal business proposal which appears to have originated at the supplier that the proponent was err, proposing. I have also received word from others saying, This regularly happens to me because I deal with public relations companies who always use the very latest spiffy version of Word and Powerpoint and seem to be totally unaware that not everyone does the same. Normally I junk these docs, but if I need them I view them ... and often see where corrections have been made... I have never seen anything really sensitive as a result of this, probably because most press releases aren't on very sensitive subjects. Usually I see comments like ``CLAIRE: should we describe what the possible treatment options might be?'', plus minor word-changes. But I live in hope. Charles Wankel posted a message concerning this to the [14]E-Media list of the [15]Academy of Management saying, I received a paper for an effort that I was an editor for from someone who had used a ghostwriter. The ghostwriter had had embedded her name in such a way that when I looked at the document in a source view I could see it with the dates that wrote, edited, and re-edited drafts of the document. Typically attached ``wrong'' to email While this is not strictly speaking a problem with MS-Word files, it is a related problem. People and systems that think that it is right to just send such things, seem to think that it is OK to send everything with the MIME Content-type of application/octet-stream and let the recipient work things out from the filename info that is also sent. That is a violation of the intent of the MIME standards, and indicates broken design for exchange of information. Word is not device independent I have been told that MS-Word documents will format differently depending on the specifics of the printer. This is not merely issues of printer resolution or color depth, but the actual formating of the document will differ. I was surprised to learn this. I had assumed that Word was ``What You See Is What You Get'', but it appears that I was mistaken about that. So it won't even achieve the goal of ensuring that your recipient sees things with all the formatting you see things with even if the recipient also uses MS-Word. Word isn't even good at what it is designed for Word produces probably the worst output and is the slowest and most tedious to work in of any document preparation system I've seen in the past 15 years. I find it remarkable that when people are presented a choice between a structural mark-up system (what you mean is what get) versus a visual mark-up system (what you see is all you get) people opt for the latter. For more on this point see secton [16]5.2. 2 Alternatives When talking about things sent by email it is important to distinguish between document exchange and message exchange. Message exchange is typically what one does by email. Making announcements or participating in a discussion, and many of the other things we typically do with email. For these plain text is the only reasonable thing. It is the safest, most portable and by far the most compact. It allows responses quoting portions, and has none of the dangers mentioned above. The small added value of the formating information isn't worth all of the problems. If you absolutely need to present the formating information for document exchange, then use a page description language like PDF. Alternatively, you can use RTF which is designed specifically for exchange of word-processor documents, and is a fixed open standard. Also consider using (standards compliant) HTML. Please note that I am not in any way advocating the use of HTML in ordinary email. It is grossly inappropriate for that for reasons that are beyond the scope of this document. 3 Where MS-Word is appropriate MS-Word is appropriate for document exchange among co-authors of a document who are all developing it and have agreed before hand to use MS-Word. If you have been referred to the document you are now reading, then the person who referred you to it probably doesn't consider themselves party to such an agreement, and having sent them an MS-Word document is inappropriate. 4 Response to the ``it's the emergent standard'' refrain Several people have responded with sophisticated ``network analysis'' essays about MS-Word being a de facto standard, and pointing out that even if the standard isn't the optimal one, it is better to go along with the standard anyway. My counter argument is two-fold: 1. Whether or not the argument about emergent standard holds for authorship (eg, ``I use Word because it is what my potential co-authors use'') has little bearing on what you use for document exchange. I use LAT[E]X for document preparation, but I distribute them as PDF.[17]3 So there may be an argument for using MS-Word even though it is inferior to other options, but that in no way suggests that MS-Word should be used for document exchange. 2. The second argument is an ethical one, and I start with an analogy. Over the past few years it has become fashionable in the US to drive some form of truck as a primary commuting/errands vehicle. There are many issues regarding that fashion, but for this analogy I would like to focus on two of them. When two vehicles collide the occupants of the lighter one are far more likely to suffer injury then they would if the had collided with an equally light vehicle. So when someone drives a truck, they are putting those in normal sized vehicles at an extra risk. The second property is similar. The headlights of the trucks are much higher off the ground than those of cars. Driving a car at night with one of these trucks close behind you is extremely annoying and possibly dangerous. In both of these cases, the drivers of the trucks don't experience the disadvantage of others driving trucks. In the first case, they too are in heavy vehicles, and in the second the driver is high enough off the ground to not be impaired by the headlights of other trucks. By the logic of the ``emergent standard'' advocates, the only way to deal with the truck problems I've described is to switch to driving a trunk oneself. The emergent standard argument might have some validity if the standards were arbitrary, but if some are particularly destructive to community as a whole, they should be opposed. Use of MS-Word for document exchange is simply bad network citizenship. Paraphrasing Juhapekka Tolvanen: using MS-Word is like smoking; using it for document exchange is like blowing smoke in everyone's face. 3. There is a third argument, closely related to the second: Do you want to be part of Microsoft's marketing effort? 5 History and related documents 5.1 Similar documents When I first wrote the first version of this document in March, 2001, it was because I not only was fed up with people sending me unwanted MS-Word documents, but because I was tired of explaining repeatedly why I objected to them. I wrote this to be part of a canned response. Being remarkably lazy, I didn't want to investigate and write this up if someone else had already written something. So I did a little bit of searching for documents like this. I knew form personal communicatin that while I am in a minority there is a substantial minority which feels exactly the same way. I expected that someone would have already written something like this document. I didn't find any when I looked, but clearly I didn't look carefully enough. I have since been informed of others that I've missed. I list them here, along with some which were written after my document. [18]plaintext: In praise of practical e-mail hygiene This is Martin Vermeer's essay. It covers the same points as mine but goes deeper into trying to persuade people to be better network citizens. [19]http://www.netby.dk/Oest/Europa-Alle/vermeer/plain.html [20]We can put an end to Word attachments This is an article by Richard M. Stallman advocating efforts like mine to discourage people from sending MS-Word documents. The article itself is aimed at those who already know that Word attachments are wrong. [21]http://www.gnu.org/philosophy/no-word-attachments.html [22]Miksi on typerää postittaa sähköpostin... As you can see, this detailed essay and analysis by Juhapekka Tolvanen is in Finnish. I don't read that language, but there are some useful links from that. He comes up with a very useful analogy, which I will rephrase more harshly: Using MS-Word is like smoking; emailing those files is like blowing smoke into other people's faces. [23]http://www.cc.jyu.fi/~juhtolv/mswordmail.html [24]MS-Word? nom obrigado A similar document to mine, available in Portugues and Galician, by Ramón Flores d'as Seixas. While this document is based on the others listed here, it also adds points about what makes a good document exchange format. It also discusses the values of standards of exchange in terms of establishing a level playing field. The Galician is pretty much readable to those who can read Spanish. [25]http://members.tripod.com.br/ramonflores/word/index.html 5.2 Rants about MS-Word The focus of this document has been on the misuse of Word for document exchange. It is geared toward MS-Word users to encourage them to send documents in other formats, even if they continue to use Word for document production. The arguments I've presented stand even if MS-Word were a good tool for document preparation. However, I'd also like to point to some documents which argue (correctly in my view) why MS-Word is a bad choice of document preparation system and not just a bad choice of document exchange format. [26]Word Processors: Stupid and Inefficient by Allin Cottrell discusses what is wrong with What You See is All You Get systems using visual mark-up, as opposed to the far more reasonable structural system where you separate the tasks of controlling the appearance from the task of writing the content. [27]No Proprietary Binary Data Formats by Sam Steingold. This discusses the dangers of keeping important data in formats which require restricting and licensed software to recover. MS-Word is a proprietary and secret document format. You are trusting your future access to you own documents to the whim an a persistent monopolist. [28]http://www.podval.org/~sds/data.html 5.3 Reaction so far As far as I can tell my campaign has met with little success so far (January 2002) other than a few people taking some care to send me RTF documents instead of MS-Word documents, with no change in their general practice. If I get any response at all it is typically ``Well, you're right but I'm going to stick with my current practices.'' I find that disappointing, particularly when people acknowledge the correctness of the ethical argument I make. 5.4 Shameless plug If you have found this interesting, you may wish to see other netrants I have at [29]http://www.goldmark.org/netrants/. _________________________________________________________________ Footnotes: [30]1Among others, I would like to thank Alan Frame, Dave Reader, Pete Mitchell and Juhapekka Tolvanen for their comments on an earlier draft. Your name can be added here as well. Just provide useful comments and suggestions. This document is available in several formats from [31]"http://www.goldmark.org/netrants/no-word/" [32]2In the words of a federal judge. [33]3Using LAT[E]X does have exactly the cost described by those who raise the ``de facto standard'' argument: I find myself limited in co-authors to a subset of clueful, intelligent and network cooperative individuals. _________________________________________________________________ File translated from T[E]X by [34]T[T]H, version 3.00. On 9 Feb 2002, 11:54. References 1. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFtNtAAB 2. http://www.goldmark.org/jeff/ 3. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc1 4. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc2 5. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc3 6. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc4 7. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc5 8. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc5.1 9. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc5.2 10. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc5.3 11. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tth_sEc5.4 12. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFtNtAAC 13. news:1ew1ifh.cerbxl1boqfiyN%25alan.frame%40acm.org 14. http://www.aom.pace.edu:81/guest/RemoteListSummary/emedia_l 15. http://www.aom.pace.edu/ 16. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#sec:wordrants 17. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFtNtAAD 18. http://www.netby.dk/Oest/Europa-Alle/vermeer/plain.html 19. http://www.netby.dk/Oest/Europa-Alle/vermeer/plain.html 20. http://www.gnu.org/philosophy/no-word-attachments.html 21. http://www.gnu.org/philosophy/no-word-attachments.html 22. http://www.cc.jyu.fi/~juhtolv/mswordmail.html 23. http://www.cc.jyu.fi/~juhtolv/mswordmail.html 24. http://members.tripod.com.br/ramonflores/word/index.html 25. http://members.tripod.com.br/ramonflores/word/index.html 26. http://www.ecn.wfu.edu/~cottrell/wp.html 27. http://www.podval.org/~sds/data.html 28. http://www.podval.org/~sds/data.html 29. http://www.goldmark.org/netrants/ 30. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFrefAAB 31. http://www.goldmark.org/netrants/no-word/ 32. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFrefAAC 33. file://localhost/home/goldmark/public_html/netrants/no-word/attach.html#tthFrefAAD 34. http://hutchinson.belmont.ma.us/tth/ === Converted from HTML to text by lynx ===