Word Cleaner – Doc To HTML Converter Software to batch convert word files to HTML
Convert your text files to HTML and clean up the code
Word Cleaner 5.1.1 Changes – Released 14th Sept 11
- [bug] WC Standard version and Right click menu options that were not working
- [bug] Issue uninstalling WC shell ex dll – not removed due to explorer lock
- [bug] Output images folder the same as doc folder and Test (file name).doc issue
Word Cleaner 5.1 Changes – Released 06th Sept 11
- [bug] Apollo 10 and WC page split code does not work.
- [change] New Apollo 10.4 engine version.
- [change] Tweak Clean up MS Word html template commands in order to clean better, update WC MS Word templates.
- [new feature] Convert to Epub e-book format.
- [new feature] Numbered html entities.
- [bug] Converting text emails/urls do not work properly in WC templates.
- [limitation issue] Issue with license file and it’s location + limited permissions.
- [bug] Copy to clipboard not working with body only, entitles etc.
- [bug] Delete tag removing any tags starting with a char, eg u will remove all tags starting with char u.
- [new feature] Add an option to enter PRO serial – register again in WC.
- [new feature] WC export to CSV format.
Word Cleaner 5.0.1 Changes – Released 06th June 11
| Product Version | Category | Updated | Summary | Fixed in Version |
| Word Cleaner 5 | change | 2011-06-06 | Apollo will have Title and Description from MS Word doc in img alt attribute, Apollo update 10.1 | Word Cleaner 5.0.1 |
| Word Cleaner 4.7.2 | bug | 2011-06-06 | Modfiy metadata template code removing all meta tags | Word Cleaner 5.0.1 |
| Word Cleaner 5 | bug | 2011-06-06 | Browser preview right click menu options not working | Word Cleaner 5.0.1 |
| Word Cleaner 5 | bug | 2011-06-06 | Image preview of comparison chart not fully visible | Word Cleaner 5.0.1 |
| Word Cleaner 4.7.2 | change | 2011-06-06 | Clean MS Word html option will be disabled and turned off for Titan and Apollo | Word Cleaner 5.0.1 |
| Word Cleaner 5 | new feature | 2011-06-06 | Option to enable/disable right click menu | Word Cleaner 5.0.1 |
| Word Cleaner 5 | new feature | 2011-05-25 | ODT file format support | Word Cleaner 5 |
| Word Cleaner 4.7.1 | bug | 2011-05-24 | On Windows Vista/7 Help on template editor form does not work, gives an error | Word Cleaner 5 |
| Word Cleaner 4.7.2 | bug | 2011-05-23 | delete_all_tags bugs in Template management and in template processing code | Word Cleaner 5 |
| Word Cleaner 4.7.2 | change | 2011-05-16 | WC will check if Word instance is running and will connect to it, WC4 was creating new Word instance all the time | Word Cleaner 5 |
| Word Cleaner 5 | new feature | 2011-05-11 | WC end line Windows/Unix option | Word Cleaner 5 |
| Word Cleaner 5 | change | 2011-05-05 | Apollo engine updated | Word Cleaner 5 |
| Word Cleaner 4.7.2 | change | 2011-05-04 | New Syntax editor version update | Word Cleaner 5 |
| Word Cleaner 5 | new feature | 2011-05-04 | New error reporing form | Word Cleaner 5 |
| Word Cleaner 5 | new feature | 2011-04-28 | HTML entitles option on ecoding tab | Word Cleaner 5 |
| Word Cleaner 4.7.2 | bug | 2011-04-28 | When changing WC converting engine in template icon in ribbon is not being changed | Word Cleaner 5 |
Word Cleaner 4.7 Changes – Released 20 May 10
The main change is that converted images are now far higher quality.
[limitation issue] Doc 2003 files having text frames with images will not convert with high images quality – resolved.
[change] Optional image quality for WC – resolved.
[change] Not load doc to preview when adding a file to the convert files list – resolved.
[bug] WC not exiting on Exit word cleaner button click on Report Error form – closed.
[bug] Empty custom file name WC error message – resolved.
[change] Internal Converter and MS Word images quality issue – resolved.
[bug] Item panel in Ribbon has display and scrolling issues – resolved.
[bug] MS Word transparent PNG saved as JPG in WC showing as black images – resolved.
[change] Remove 600 DPI value from Image DPI – 600 is even not suported by MS Word max is 440 – resolved.
[change] Change notifictation message box to log alert info when manually changing output file name – resolved.
[bug] Image DPI difference issue when converting with Internal Converter & MS Word – resolved.
[third party problem] Images quality when converting MS Word – DPI is not working – resolved.
[change] Change image output path \ to / char which is URL proper separator char – resolved.
[change] WC shows many error notifications about invalid output path – resolved.
[bug] When converting file names with ( – special chars WC gives reg exp error – resolved.
Word Cleaner 4.6.1 Changes – Released 21 Dec 09
Two new templates added:
- Convert to Word Filtered HTML and remove formatting
- Convert to Word Filtered HTML
Add new regex sandbox test feature. Users can now run reg exp from Code Editor (main window) or Template management 1) Find/replace easy mode (find and replace values from text boxes will be pasted to boxes in reg exp form) 2) Advanced editor (user can select a text in template and it will be used as regexp to match – first txt box)
Bug fixes:
- Issue when converting files with .htm extension via MS Word when output file name is the same (extension could be different)
- Convert emails/urls template code has issues when input html already has <a> tag links in html
Word Cleaner 4.6 Changes – Released 16 Nov 09
===================================
- [bug] Some issues on 64bit Vista/W7- resolved.
- [change] HTML input file has issues with CSS/Images/Metadata etc- resolved.
- [bug] Typo ConvertedFilesFoder (missing l)- resolved.
- [new feature] Save document as TXT for Internal Converter and MS Word + Page Split- resolved.
- [idea] Check word 2003 and compare it with 2007 html- resolved.
- [change] Convert
html span with bold/italic/underline style to <strong> and tags- resolved. - [change] Redesign WC Template Editor window to new MS Word option Style (white)- resolved.
- [new feature] Send template via email option and copy template to clipboard option- resolved.
- [change] Template based body only option causing issues with output to txt and other places- resolved.
- [bug] Find/Replace \r \n bug fix in Easy editor mode- resolved.
- [new feature] Add 2 new commands to delete empty tags + UI for Template Easy mode- resolved.
- [change] Add custom WC exception for preview in template editor when file is used by other app like MS Word- resolved.
- [bug] delete_tag_attribute bug, not removing particular attributes with ” ‘ ‘ “- resolved.
- [new feature] Send error document and current template on error- resolved.
- [change] Layout Changes for Easy Editor – Moving controls for better experience- resolved.
- [change] Move option to backup html before cleaning- resolved.
- [change] Reading metadata from word files when converting with Internal engine- resolved.
- [change] Reconvert files without having to reset the file status- resolved.
- [new feature] CSS path options – let the user specify a relative path eg the same folder as the html file or a sub folder eg /css/css file- resolved.
- [new feature] Add custom css link tag to easy editor mode- resolved.
- [change] Refactor Internal/MS Word converting code that it could be used by Preview in Easy Mode- resolved.
- [bug] WC is changing doctype & html tag links for auto links option- resolved.
- [bug] Row index provided is out of range exception- resolved.
- [new feature] Multiple line find and replace feature- resolved.
- [new feature] Add option to embedd images in html – works on for Internal engine- resolved.
- [bug] removes apostrophes for the text – resolved.
- [bug] WC will not create file or it will be empty when XHTML validation will encounter major issues- resolved.
- [bug] Exception when reading MS Word docx properties- resolved.
Version 4.5.2 – Released 18th June 2009
A minor update:
New command: find_and_replace_regexp_text()
Here is an example of how to use this command. Take the text: ‘A cat eats a mouse, I like my cat’ if you want to change only the first ‘cat’ to ‘dog’ you can use this new command find_and_replace_regexp_text(‘A cat eats’,'cat’,'dog’);
It will change the text to ‘A dog eats a mouse, I like my cat’ , only the first cat is changed to dog.
[bug fixed] Autocomplete list of commands at advanced editor was not autocompleting – resolved.
[bug fixed] First selected template not saved on WC exit – resolved.
Version 4.5 – Released 06th May 2009
This is a major update, all users are advised to update.
With this version of Word Cleaner we have working hard to make the software easier to use with a more user friendly interface. The main changes are all settings (images, CSS etc) are now at the template level, not the global level. Also we have introduced an easy template editor that makes it very easy to select the options you need. The idea with these changes is that you can easily create a template to suit your needs.
New features in this version:
- New easy to use template editor
- All settings are now set at the template level, not on a global level
- Most options can now be selected by using simple tick boxes
- Improved template preview, browser and code preview are now combined
- Brand new code editor, this one is more stable and more features:
- Notice the icons on the top of the code editor
- Keyboards shortcuts for copy, paste etc now work
- Its now easier to select all the code and copy it to the clipboard
- Load & Save encoding modes – ANSI (systems default code page), ASCII and UTF8
- New right click menu on the file list with common functions:
- convert file
- remove file
- reset file
- Open file location
- copy converted html to clipboard
- Added support for meta data, e.g. author description, keywords etc.
- Removed the file converted! Message box, to save you time by not having to click it
- New option to split pages by H1 tag
- New option to only give body content, with no header. This is usual for when you need to copy into CMS systems.
- Improved image support, you can now specify relative directories for images.
- New template manager tab in the main interface.
- New command line commands:
- Command to change file name
| [bug] TX15 Sp1 has issues on 64bit Vista/W7- resolved. | ||||
| [change] HTML input file has issues with CSS/Images/Metadata etc- resolved. | ||||
| [bug] Typo ConvertedFilesFoder (missing l)- resolved. | ||||
| [new feature] Save document as TXT for TX and MS Word + Page Split- resolved. | ||||
| [idea] Check word 2003 and compare it with 2007 html- resolved. | ||||
| [change] Convert TX html span with bold/italic/underline style to <strong> and tags- resolved. | ||||
| [change] Redesign WC Template Editor window to new MS Word option Style (white)- resolved. | ||||
| [new feature] Send template via email option and copy template to clipboard option- resolved. | ||||
| [change] Template based body only option causing issues with output to txt and other places- resolved. | ||||
| [bug] Find/Replace \r \n bug fix in Easy editor mode- resolved. | ||||
| [new feature] Add 2 new commands to delete empty tags + UI for Template Easy mode- resolved. | ||||
| [change] Add custom WC exception for preview in template editor when file is used by other app like MS Word- resolved. | ||||
| [bug] delete_tag_attribute bug, not removing particular attributes with ” ‘ ‘ “- resolved. | ||||
| [new feature] Send error document and current template on error- resolved. | ||||
| [change] Layout Changes for Easy Editor – Moving controls for better experience- resolved. | ||||
| [change] Move option to backup html before cleaning- resolved. | ||||
| [change] Reading metadata from word files when converting with TX- resolved. | ||||
| [change] Reconvert files without having to reset the file status- resolved. | ||||
| [new feature] CSS path options – let the user specify a relative path eg the same folder as the html file or a sub folder eg /css/css file- resolved. | ||||
| [new feature] Add custom css link tag to easy editor mode- resolved. | ||||
| [change] Refactor TX/MS Word converting code that it could be used by Preview in Easy Mode- resolved. | ||||
| [bug] WC is changing doctype & html tag links for auto links option- resolved. | ||||
| [bug] Row index provided is out of range exception- resolved. | ||||
| [new feature] Multiple line find and replace feature- resolved. | ||||
| [new feature] Add option to embedd images in html – works on for Internal TX engine- resolved. | ||||
| [change] Using new compoments TX15, DNB 8.1- resolved. | ||||
| [bug] removes apostrophes for the text (brian) – resolved. | ||||
| [bug] WC will not create file or it will be empty when XHTML validation will encounter major issues- resolved. | ||||
| [bug] Exception when reading MS Word docx properties- resolved. |