I'd like to recognize selection-marks (yes/no, [x]/[ ]) with the form-recognizer. I'm using the labeling tool and wondering if it's possible and if so how? The third layer of the labeling tool is named "Selection Marks", so this may be something which is in the works.
Selection marks / Checkboxes are now available in Forms Recognizer API 2.1 https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeWithCustomForm
Form Recognizer does not yet support check boxes and selection Marks, this feature is coming soon stay tuned.
Neta - MSFT
Related
The forms I want to use in form recognizer have several checkboxes. Currently, form recognizer's OCR engine doesn't seem to detect checkboxes. Is there a work-around for this, or is there a way to train checkbox recognition?
I've also tried using tesseract on windows and linux, but I couldn't solve it there either.
The checkboxes are your standard open squares. Sometimes they may have an "x" or checkmark on them.
If there is no solution, then I may ask the customer to change from checkboxes to filled in circles (radio-buttons). I haven't tested that yet, to see what OCR would make of them.
Edit: I read the form recognizer documentation and I saw that it explicitly says checkboxes and radio buttons are not supported. I wonder if anyone has a awork-around?
An OCR is by definition made for "character recognition" (see here). A checkbox or a radio button is not a character, so it will not be recognized by an OCR.
You can still give a try to a custom vision detector to find those items, but it will be a bit complicated as this service will only find those items in your document and you will have to combine with an OCR call to get the text, then try to match the zones in the document to know which text is associated with which combo/radio button
Support for checkboxes was added to Form Recognizer in version 2.1 (in public preview as of September 2020). From the announcement:
Checkbox / Selection Mark detection – Form Recognizer supports
detection and extraction of selection marks such as check boxes and
radio buttons. Selection Marks are extracted in Layout and you can now
also label and train in Train Custom Model - Train with Labels to
extract key value pairs for selection marks.
There is now a selectionMarks object in the Get Analyze Layout Result API response that lists detected selection marks and their state, either selected or unselected.
Support for labeling checkboxes and selection marks was also added to the sample labeling tool as of version 2.1.
In Form Recognizer if the forms have a consistent layout, you might be able to tag the area using the new GUI tool and pass that specific area to OCR to try to improve results.
The method #Ram-msft suggested can work, but I find that the recognizer struggles to consistently pick out any single characters in a box - although to be honest checkbox type boxes seem to work better than say a number in a box (in my experience at least).
As long as you're interested in any "value" inside the box (i.e. it's not empty) then that method should give you reasonable results until they come up with a true solution.
What is the best approximation algorithm to implement full-text fuzzy search. For example we have a dropdownlist with the following data (from SQL datasource):
Company Policy
Product Catelog
Our Partners
Now I want to replace it with an autocomplete textbox, such that when the letter "p" is typed the list shows all three results. It should start matching the first letter of the first word or second word and so on. Also, it should highlight or make the matched letters bold in the suggestions dropdown.
Is there a readymade control for ASP.NET (with JS or jQuery) to deliver all the aforementioned functionality? Otherwise if I have to implement it, is there a tutorial/blog which point me in the right direction?
I believe this is what you're looking for.
It's jquery ui it has the autocomplete functionality described.
check out this one, I used it and works very well
http://www.asp.net/ajaxLibrary/AjaxControlToolkitSampleSite/AutoComplete/AutoComplete.aspx
I'm just starting to get my hands dirty with silverlight and "value converter" keeps popping up. I don't remember reading about them in the ASP.NET web app world. Is this something special in Silverlight/WPF? What is its purpose?
Thanks!
From this excellent post:
When you’re binding data to controls there will be times when the data needs to be modified or tweaked some on the way into a control or as the data leaves a control and goes back to the source property (during a TwoWay binding for example). Sure, you can always write code to change a given value, but in many cases it’s much easier to write a simple value converter instead that can be re-used. In this post I’ll walk through creating a value converter and then show the code for a few of the value converters I find myself using fairly frequently.
There is stark contrast between the web platform and the Windows when it comes to data binding. Especially in WPF / Silverlight / Windows Phone 7. These technologies support databinding differently compared to Web which is stateless.
A very common example of a value convertor is when you want to hide or show a control in WPF / Silverlight. Controls like stack panel have visibility property which is an enum. We can assign values like Visible / Collapse / Hidden to show or hide the stack panel. In most cases the visibility is controlled by a boolean value. So you use a convertor to convert the boolean to visibility.
Another example of value convertor could be formatting of amount fields. Say you want to display 1000 which is stored in the database as $1,000.00 in an amount text box. You can use the value convertor to do so.
The possibilities are endless. You can think of value convertor as a visual representation of something. Another example I can think of is the completion progress of any task. You can show a nice colourful progressbar instead of showing values like 10%, 20%, 30% completed :)
Hope this helps.
I am developing a flex application for collaborative data analysis. To present the data my application uses standard and custom components (grids, charts etc.).
I want to deliver the feature that allows users making notes over the GUI of my application. So, other users will see they notes late on.
At the moment my question is: How can be implemented mechanism that allows making notes over the GUI? All suggestions and examples are welcome?
There are a lot of ways to approach this. ( Check out Buzzword, MS Word, and Acrobat all for slightly different approaches of note taking on a document--I assume an application GUI could use any of the same approaches ).
I'd start by saying that the click event bubbles:
http://livedocs.adobe.com/flex/3/langref/flash/display/InteractiveObject.html#event:click
So, listen for the click event on every child of your main application file. When you receive that click event you can provide some business logic as to whether or not you want to add a comment /note on the component that was clicked. Then you just some "note" component for collecting and displaying the note data. You an position them based on the x, y values of the click event.
So, actually my problem is much easier then I expected (thank for great design of Flex).
I decided to utilize PopUpManager functionality for my task. It does everything I need at the moment.
I am trying to figure out the best way for doing spellcheck in flex application.
I need a support of multiple languages, like: english, russian and so on. Also, I don't need a realtime spellchecking, only in some particular moment(for example, user click button: "check my text"). I found clientside solution here - flex:SpellCheck , but looks like it able to check only english text. So now I am thinking
serverside solution, that recieve a text and needed language, check it(for example with aspell) and returns result. How do you think: is this a good way or I am missing something?
Thanks.
You can check the Squiglly project on Adobe Labs - link
Quoting the page:
Squiggly is a
spell checking engine for Adobe®
Flash® Player and Adobe AIR®. The
Squiggly library allows you to easily
add spell checking functionality in
any Adobe Flex® 3 or Flex 4 based text
control. The distribution package
consists of a utility for building
your own spelling dictionaries, a
sample English dictionary [...]