Document attributes provide ways to find and navigate among documents of interest. In fact, one of the biggest challenges in e-discovery and content management is to assign the best classifications and attributes to documents in a collection. Here is a checklist of where to look for different types of attributes: Within the Files Themselves, Either […]

Read More

The Data-Information-Knowledge-Wisdom (“DIKW”) model is a useful for examining how well an organization is doing in deriving value from its unstructured content. In his book, Too Big to Know,* David Weinberger credits Russell Ackoff, a leading organizational theorist, with making a pyramid-shaped depiction of the DIKW model in a 1988 address to the International Society for […]

Read More

In Everything is Miscellaneous, David Weinberger points out that no single classification system will necessarily best serve all those who use the classified content, and he points out several tools used by popular websites to let individual users create and share what they consider to be significant information. Many of those tools could be applied to improve the […]

Read More

Sometimes a large percentage of files found in unstructured content locations like file shares and ECM systems were actually created by database-driven business systems. These documents are essentially filled-in templates populated with specified database elements.  Whether stored as PDF or TIF, these computer-generated files are completely redundant to information stored in the database and could […]

Read More

Imagine the internet with great search functionality but no hyperlinks. You could locate any individual page or at least have it included in extensive search results, but then you’d have to conduct other searches to find related pages, even on the same website. Not very useful, right? The point is that text search functionality alone is […]

Read More

One of the biggest challenges in information governance is determining exactly how to manage unstructured content sitting on file shares and ECM systems. Here are some goals to consider when deciding how to deal with such unstructured content: 1. Raise Awareness. The system should provide managers with an awareness of the type and number of unstructured […]

Read More

Most file auto-classification systems rely on the presence of accurate textual representations of the files being classified. Organizations that use those auto-classification systems need to be aware of several problems with a text-reliant approach: Ignoring Non-Textual Files. Many files have no text associated with them, e.g., files output as PDF or TIF files from user-software or captured as […]

Read More

Using Unstructured Content to Update & Audit Process Management Systems The Need Data elements entered in enterprise process management systems are often taken from and evidenced by supporting documents. For example, in home mortgage loan tracking systems, names of borrowers may initially be taken from original loan applications. As more documents are generated during the loan […]

Read More
The BeyondRecognition Network

the-beyondrecognition-network-of-companies