Hacker News new | ask | show | jobs
by tuardoui 1123 days ago
You should look at LayoutLM models for a NER task. Then your pipeline should look like : - Identity the menu sub structure (title, item list ...) - Classify each item with 2 labels.

The training process is not hard, but the data gathering / cleaning / labelling can be a little long.

1 comments

Thanks! I haven't heard of LayoutLM but something that can understand structure from a few examples could be just what I need.