Why is the found text in DOCX and XLSX files surrounded by code or tags?

0 votes

Been using Agent Ransack perfectly for a few weeks but now all hits seem to include code or tags of the file rather than just a summary of the hit.

I'm not sure if I have changed a setting but I can't seem to get rid of the code. I have included an example below:

<xml:space="preserve">Please provide information </w:t></w:r>
<w:r w:rsidR="003B507C" w:rsidRPr="00357C75"><w:t>on</w:t></w:r><w:r w:rsidRPr="00357C75"><w:t>:</w:t></w:r></w:p><w:p w:rsidR="0038325F" w:rsidRPr="00357C75" w:rsidRDefault="00D76793" w:rsidP="007524E2"><w:pPr><w:numPr>
<w:ilvl w:val="0"/><w:numId w:val="3"/></w:numPr><w:spacing w:after="120"/></w:pPr><w:r w:rsidRPr="00357C75"><w:t>Q1.1</w:t></w:r>
<w:r w:rsidR="00896345" w:rsidRPr="00357C75"><w:t>: Your solution to recruitment (max 1 page)</w:t>
</w:r></w:p><w:p w:rsidR="001064FC" 
asked Oct 17, 2014 by Lawpf2001 (70 points)
edited Oct 17, 2014 by Lawpf2001

1 Answer

+1 vote
 
Best answer

You've probably switched OFF the Office/PDF documents option in the Options tab:

Office/PDF documents

If that option is switched OFF Agent Ransack will search the raw data of the file.

Other things to check:

  • The documents aren't password protected
  • If you are running on a version of Windows prior to Win 8 or don't have Office 2010 (or higher) installed you should install the Microsoft Office 2010 Filter Packs.
answered Oct 17, 2014 by dave (53,670 points)
selected Oct 18, 2014 by dave
I've searched with that setting both on and off and both show the code :s
Is it a regular DOCX file? Does it also happen to other DOCX files?
It seems to be happening with docx and xlsx but not with doc or xls.
Do you have either Microsoft Office 2010 (or higher installed) or have the Microsoft Office IFilters installed:
http://www.microsoft.com/download/en/details.aspx?id=17062
Yes fixed it. Installing IFilters seemed to turn it back to normal.Thanks for your help!!
...