Today’s Post by: Roger Welch, Senior consultant, IBM Analytics
IBM Datacap has never been more exciting than it is right now. There’s a palpable anticipation with new tool sets that is leading SME’s to discover all the new possibilities. The advancements of text layout analyzation brought to us by the Insight Edition coupled with our integration to natural language processing technologies such as SystemT allow Datacap to cognitively understand unstructured content like never before. These abilities are in their infancy, and our clients are desperately looking for us to mature them as quickly as possible. This is what drives the anticipation at times. But everyone sees the potential. And everyone wants in.
Let’s back up a little bit and understand the basics before we blow the roof off content capture and extraction. For decades everyone understood the mantra of capture. Ingest, Classify, Extract, Validate, Verify, Re-Validate, Export. Classification was the link between processing and exception handling. Figure out the classification first and then we will know what to extract. Name the process before applying the process. Well, you can say goodbye to that.
The future of content capture and extraction is looking to we skip the classification and go straight to the extraction of key pairs. Any business process is keenly aware of their content or key pairs. For example, it takes 6 pieces of information to move money from 1 bank account to another bank account. As the pieces of content start to match into key pairs of a business process, the system begins to create a validity score. I don’t need to know the piece of paper is a telegraphic transfer before looking for the content. If I find the 6 pieces of content then I know exactly what I’m supposed to do. Classification is an afterthought.
Take for example receipts. They are everywhere. Ingest one or one hundred into the new Datacap Insight edition and we’ll tell you what day it’s from, where you went, and how much you spent. Or if you are a developer and really want to explore what’s possible, integrate to other dictionaries that will also tell you if you over spent your per diem rate or if it met with your care and wellness plan.
I no longer care where you’ve hidden the content – correspondence, table structures, line items, or traditional forms. Give us a global list of key pairs that process lines of business and we’ll start processing business. That’s cognitive capture with Datacap. Learn more through Cognitive Capture video.