Consulting

Results 1 to 6 of 6

Thread: Corrupted word doc - document.xml corrupted

  1. #1

    Corrupted word doc - document.xml corrupted

    Word document containing full semester uni work of my friend. She had NO backups and file was kept on USB key (hard to believe but true).
    I have tried a number of recovery tools including onlinerecovery.com, DataNumen, Corrupt Docx etc. They all fail.
    I tried renaming as .zip and extracting the document.xml - no luck.
    tried hex editor to see if I could see any text - nothing.
    I'm not interested in recovering the images in the file (got those ok). Just need the text if possible. (or point me to tools which might work?)
    Thank you very much.
    Last edited by esmond; 11-10-2015 at 02:54 PM.

  2. #2
    VBAX Sage
    Joined
    Apr 2007
    Location
    United States
    Posts
    8,729
    Location
    https://www.piriform.com/recuva

    I've had good luck with Recuva free version to see what files that have been deleted are still recoverable

    If you can run it on the USB, you might find an earlier or temporary version that was deleted, but still lying around (assuming she hasn't written over it)

    Sounds like if a hex editor doesn't see any text in the file, then there's no text in the file

    How big is/was the file?
    ---------------------------------------------------------------------------------------------------------------------

    Paul


    Remember: Tell us WHAT you want to do, not HOW you think you want to do it

    1. Use [CODE] ....[/CODE ] Tags for readability
    [CODE]PasteYourCodeHere[/CODE ] -- (or paste your code, select it, click [#] button)
    2. Upload an example
    Go Advanced / Attachments - Manage Attachments / Add Files / Select Files / Select the file(s) / Upload Files / Done
    3. Mark the thread as [Solved] when you have an answer
    Thread Tools (on the top right corner, above the first message)
    4. Read the Forum FAQ, especially the part about cross-posting in other forums
    http://www.vbaexpress.com/forum/faq...._new_faq_item3

  3. #3
    In that case, send me the corrupt word doc. My email is slchcw(at)yahoo.com . I can help to analyze and repair the file for you manually, for free.

  4. #4
    VBAX Newbie
    Joined
    Nov 2015
    Posts
    1
    Location
    The DOCX file format is just a collection of XML based layout files and other files like images packaged into one using standard "zip" compression.
    The Word document does not open in Word and only shows a generic error on my Win7 SP1 PC with Office 2003 plus the Office 2007-2010-2013 file format converter installed.
    Here's the details of the web page error I get when I view the extracted "document.xml" in the "XML Editor" that is part of MS Office and opens it with color-coded tagging and properly indented lines within Internet Explorer.
    Message: An invalid character was found inside an entity reference.
    Line: 2
    Char: 1584323
    Code: 0
    URI: file:///C:/Documents and Settings/Bill/My Documents/Downloads/Reflective-Journal-Submission-/word/document.xml
    The XML page cannot be displayed
    Cannot view XML input using XSL style sheet. Please correct the error and then click the Refresh button, or try again later.
    When I scroll to the end I see this where it is unable to display any more:
    An invalid character was found inside an entity reference. Error processing resource 'file:///C:/Documents and Settings/Bil...
    <w:b/><w:bCs/><w:sz w:val="52"/><w:szCs w:val="52"/></w:rPr><w:tab/></w:r><w:r ...

  5. #5
    I guesseventhis postwill help you onthis authoritative source. https://community.office365.com/en-us/f/155/t/255386

  6. #6
    Thank you guys for your feedback and help. Sorry, that long time did not respond. This is because the issue was solved and I forgot to tell.) Thank you!

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •