Word to HTML Conversion

They have: 30 posts

Joined: Mar 2001

Okay folks, I'm spending way too much time converting word files to HTML. I've tried a number of converters, but none really offers the combination that I want:

--W3C//DTD HTML 3.2 or at least the option to keep things simple
--Keep spacing and formatting of text as best as possible
--carry-over links and e-mails/make URL and e-mail addresses linked.

Is there anything out in the market that can do the above?

How do you guys handle multiple page Word documents that need to be formatted for HTML?

Appreciate any and all responses, have a great weekend!

-B

Megan's picture

She has: 11,421 posts

Joined: Jun 1999

Dreamweaver has a command called "clean up Word HTML" so you might want to try that. You can also "clean up HTML" generally, or convert your document to 3.0 browser compatible or xhtml. Oh, and "apply source formatting" will clean up the spacing and formatting and stuff. You can also customize the settings for all of those commands.

He has: 1,016 posts

Joined: May 2002

LOL!, Megan, welcome to the DWA (DreamWeaver Anonymous). Please stand up and share..

"My name is Megan. I'm a DreamweaverHolic"

Laughing out loud

He has: 1,380 posts

Joined: Feb 2002

why are you using word to create your documents? why not just code the HTML directly?

Megan's picture

She has: 11,421 posts

Joined: Jun 1999

Sometimes you might get documents given to you from other people who may have created them in bad editors like that. It happens.

Actually, zollet, I happened to be using those commands right as I was typing that - in my case to clean up 97-era code and convert to xhtml. The "Clean up HTML" command that lets you specify a list of tags to get rid of entirely. Nice (byebye !)

He has: 1,016 posts

Joined: May 2002

You don't have to tell me, I've been using Dreamweaver for a couple of years now Laughing out loud

Megan's picture

She has: 11,421 posts

Joined: Jun 1999

Oh, I know, I just thought that might be a helpful thing for others to be aware of. It sure is better than doing a s/r for ever possible combination fo font tags!

Jack Michaelson's picture

He has: 1,733 posts

Joined: Dec 1999

Beanboy,

maybe this helps, it's a free online document convertor:
http://wheel.compose.cs.cmu.edu:8001/cgi-bin/browse/objweb

ciao,

He has: 1,380 posts

Joined: Feb 2002

well if you want to put documents online, then make an investment to change them to pdf files. then you can make a regular webpage, and then have a link to the pdf files...much easier

Suzanne's picture

She has: 5,507 posts

Joined: Feb 2000

Kyle, while it's easier, it's not actually the best idea. PDF files are great for things that people will be PRINTING off, they really aren't that good for viewing.

However, in your suggestion is an inkling of a good idea -- Acrobat is a powerful program, you can use it to convert the Word file to a marked up PDF file (if you use Word correctly), and then use a free plugin for Acrobat to convert that PDF file to xml or html.

He has: 1,380 posts

Joined: Feb 2002

ah hah!

They have: 30 posts

Joined: Mar 2001

Thanks for the replies folks. I have Dreamweaver 3, and it pukes on my page code which was created in Homesite 4.5. Yes, it works fine on everything from Netscape 4.0 and up, cross-platform. Also doesn't like the Word 2000 files I get.

As far as the content goes, all of these Word documents are being handed to me. Many times they go up as a PDF only, but there's also many instances where it also needs to go into HTML form for search engine, screen readibility and navigation issues.

I've tried the PDF to HTML converter, but usually got poorly formatted HTML in return. Haven't looked into tagging the Word document before sending it over.

Looks like I'll be checking out Dreamweaver 4 and see how it handles my HTML. At the very least, I'll dump the Word content into Dreamweaver, and then place it in my template.

Thanks for all of the comments!

-B

Suzanne's picture

She has: 5,507 posts

Joined: Feb 2000

The thing with the pdf to html converter is it requires the Word document use styles correctly. Since almost no one does that...

Good luck!

Megan's picture

She has: 11,421 posts

Joined: Jun 1999

Quote: Originally posted by Beanboy
Looks like I'll be checking out Dreamweaver 4 and see how it handles my HTML. At the very least, I'll dump the Word content into Dreamweaver, and then place it in my template.

Thanks for all of the comments!

-B

Just so you know, my experience referred to above was with Dreamweaver MX - I'm not sure if v. 4 has the same features, although now that I think about it I'm pretty sure there was a word > HTML command in there. I actually had to run this today - something like 1500 microsoft styles removed from a 10 page document! It did a great job, except that it doesn't remove inline styles, that's the only problem.

Edit: I've been really happy with how DW MX has handled my code - and I'm normally very picky about that.

They have: 30 posts

Joined: Mar 2001

Hmmm, downloaded the latest verion of Dreamweaver 6.0 and it puked on the Word 2000 created HTML documents. Giving me an error:

At line 644 of "C:\Dreamweaver MX\Configuration\Commands\Clean Up Word HTML.js":ReferenceError: MSG_metaFileError is not defined.

Still puking on my template as well. Tried creating a simple Word 2000 document from scratch with a few sentences in it, still puked on loading the HTML version of it. Using the demo version of MX...

Hmmm,
-B

Megan's picture

She has: 11,421 posts

Joined: Jun 1999

That's funny, because it worked perfectly for me the other day. Maybe zollet will have some better advice - he knows more about DW than I do..

The Webmistress's picture

She has: 5,586 posts

Joined: Feb 2001

When I was using the trial version of DW MX I kept getting error messages liek the one you did but now I have the full version I don't get any errors so it must be a bug in the trial or something.

Julia - if life was meant to be easy Michael Angelo would have painted the floor....

Want to join the discussion? Create an account or log in if you already have one. Joining is fast, free and painless! We’ll even whisk you back here when you’ve finished.