Microsoft Office Word

Even if many people are complaining about Microsoft Office and are looking for (online) alternatives, in 2017, there were 1.2 billion (88 percent of the market share) users worldwide using Word, especially professionals.

The Microsoft proprietary DOC format was the default format used – and only supported – by Microsoft Word from 1997 to 2003.

The pressure of government agencies and the open-source world pushed Microsoft to open up its Office formats in the early 2000s. Word, Excel, and PowerPoint documents became .docx, .xlsx, and .pptx, the « X » being for XML.
By 2007, the DOCX file format became part of Office 2007, and it is now the de facto standard for text documents.

However, despite this new standard, the DOC file format is still widely used worldwide: think about all those .doc files archived by companies of all industries!

Binary file format and OpenXML

A DOC document is a non-text file containing all the information related to text formatting and other relevant data.
A DOCX file is a zip with all the XML files associated with the document, thus producing smaller, lighter files that are easier to read and transfer than DOC. Image support is also more reliable with a better display and higher fidelity. It is also less sensitive to file corruption and viruses.

The DOC file format’s proprietary nature makes it difficult for external software solutions to support it in their applications. The open nature of the accurately named Office Open XML standard means that it can be read by any word processor, including online tools like Google Docs.

The GdPicture.NET Document Converter

The GdPicture.NET SDK includes the GdPictureDocumentConverter class, which provides many different methods and properties for document conversion of 100+ document/image formats.

This class also provides methods to manage parameters like font, margin, page range, or image quality.