top of page

How to Extract Data from Any Document using Bitskout

Updated: Sep 10, 2023


In this guide, we will learn how to use Bitskout's new feature to extract data from any document. This feature allows you to easily extract information from different types of forms by providing examples.


Video Instruction






Step 1.

Log in to your Bitskout account and click Create Plugin.


ree

Step 2.

The next step is to click on the Extract button and then choose "From File":


ree


Step 3.

Now we need to load an example. Bitskout needs examples from you to learn what you'd like to extract. We will use SEC filling reports as an example.

ree


Step 4.

Add examples to guide the data extraction process. Once you've loaded the file, write the fields that you'd like to extract. You need to give the field a name and the value should come from the loaded examples. See below:

ree



Step 5

You can add more examples to improve the accuracy of the data extraction. Let's add another form with a different layout.

ree

Once the file is loaded, you'll need to add the required values from this example:


ree


Step 6

Once you're done with examples, press Create and the plugin will be created. Next, depending on what you'd like to do, choose a tool where you want to use the plugin.

ree


Step 7.

Verify the accuracy of the data extraction by testing it on various forms. For example, try using an "Apple 10-Q form":

ree

Conclusion

Now your plugin is ready to be used. This way you can extract data from the documents with just a few examples.


We recommend adding 2-3 examples that have varying layouts. This way Bitskout will understand the variety of the documents that it'll have to analyze the information.


You can load documents in any language - the most important part is to add clear values from that example to extract.





Comments


bottom of page