Google HTML Tidy.
I use it for all M$ docs that I convert to HTML. Basically I take my Word or Excel file and save as HTML in either Word or Excel. I then open HTML Tidy, load the config file I created to strip out all the bloat from the M$ HTML docs. Then I load in the M$ HTML file, click the Tidy! button, click the Replace button, look thru the resulting code to verify that all of the bloat is gone and then save it. It strips out everything but the very basic commands like <p>, <b>, <i>, <h1>, <table>, <img>, etc. It's then very easy to then copy the code and paste it into the WYSIWYG editor. I love it.
Shane