Glyphs can be used to consistently deconstruct, classify, and attribute large volumes of files permitting effective management of them. Deconstruction breaks files into their smallest visual elements, classification uses data visualization at the page level, and attribution selects specified glyphs or their text values from within classifications. Deconstruction The word “glyph” has several meanings. In […]

Read More

Enterprise Content Management systems enable organizations to work effectively with their unstructured content. ECM typically takes a more holistic view of an organization’s documents than e-discovery and enhances the ability to retrieve and analyze documents beyond what e-discovery is typically able to achieve. ECM classifies unstructured content, provides controlled access to it, and assigns granular […]

Read More

Selection bias occurs when data are selected for analysis in a way that not all objects being evaluated are equally likely to be selected. This results in samples that are not representative of entire populations. An extreme example would be predicting the presidential race by only sampling New York City or Los Angeles, or predicting all […]

Read More

Implicit biases – those that we form and use without explicit consideration – can wreak havoc on achieving critical goals. One such type of bias is especially damaging when designing file classification systems – confirmation bias. That is the “…tendency to search for, interpret, favor, and recall information in a way that confirms one’s preexisting […]

Read More

Imagine the internet with great search functionality but no hyperlinks. You could locate any individual page or at least have it included in extensive search results, but then you’d have to conduct other searches to find related pages, even on the same website. Not very useful, right? The point is that text search functionality alone is […]

Read More

Positional word frequency involves identifying how many times individual words appear at the same relative locations within the pages or documents in a collection. Positional word frequency solves major problems that occur when performing three basic functions involving unstructured content: Classification Attribute Extraction/Coding Search Without positional word frequency, low-value words can cause clutter in text […]

Read More

Using Unstructured Content to Update & Audit Process Management Systems The Need Data elements entered in enterprise process management systems are often taken from and evidenced by supporting documents. For example, in home mortgage loan tracking systems, names of borrowers may initially be taken from original loan applications. As more documents are generated during the loan […]

Read More

The three most important criteria by which to judge file or document classification and coding systems are Consistency Consistency & Consistency The reason is pretty obvious: without consistency a file classification scheme cannot deliver any of the promised downstream benefits, things like enhanced retrievability, selection of appropriate retention schedules, and setting appropriate security access permissions […]

Read More
The BeyondRecognition Network

the-beyondrecognition-network-of-companies