Many organizations manage their content using enterprise content management (“ECM”) systems like SharePoint, Office 365, FileNet, Open Text, or Documentum. These ECM systems are really databases that permit organizations to associate various classifications, tags, and descriptions (collectively, “attributes”) to the content being managed and to use those attributes to find specific documents. Parties seeking documents […]

Read More

The usual approach to classifying files or documents in an enterprise collection of unstructured content is top-down: determine what the classifications should be and then write rules or scripts on how to place individual files in the predetermined classifications. This presupposes a comprehensive knowledge of what’s in a collection and what attributes can be used […]

Read More

Document images often have quality issues that make it difficult to extract text or data elements from them. For example: Forms can have lines running through much of the text. Watermarks can interfere with text recognition. Text orientation may be skewed. Once specific issues have been identified, advanced image enhancement techniques can greatly improve the quality and quantity […]

Read More

Many times it’s the hidden assumptions, the ones that are never made explicit or never periodically confirmed, that can cause the most damage when they’re wrong. In Information Governance and Enterprise Content Management one of the most basic assumptions is that there is one logical document per file with electronic records, and one logical unit […]

Read More

ECM users sometimes have the Info Gov equivalent of the deep sea diving “bends.” This condition occurs when divers ascend too quickly after deep dives and the gas that had been compressed in their bodies decompresses causing painful and sometimes fatal injuries. ECM users sometimes feel like their heads are about to explode after trying […]

Read More

There has been ongoing debate in information governance and e-discovery circles on the significance of documents that do not contain searchable text, with evidence that half or more of the documents in some collections cannot be analyzed or managed because the tools used for those purposes require textual representations. How important is this limitation in […]

Read More

Calculating MTV Ratio and True Recall Many tools designed to search or classify documents as part of the enterprise content management and electronic discovery functions in organizations depend on having accurate textual representations of the documents being analyzed or indexed. They have text-tunnel vision – they cannot “see” non-textual objects. If the only documents of […]

Read More

Without the right tools, even basic information governance tasks can be difficult. The most glaring example is document classification which is the bedrock upon which virtually all information governance initiatives rest. If you can’t accurately classify an ever increasing volume of documents and correspondence, you can’t apply the correct retention schedules, you can’t specify which […]

Read More

A recent AIIM survey report, “Information Governance, records, risks and retention in the litigation age” (link), highlights issues faced by organizations in trying to manage their documents: Custodian-based classification doesn’t work Disc storage is steadily growing with no end in sight Nobody ever seems to delete any electronic records Organizations want to unify their treatment […]

Read More
The BeyondRecognition Network

the-beyondrecognition-network-of-companies