Have some question regarding finetuning for model to add characters #1943
-
|
So the doctr v0.10.0 is working fine in detecting all the texts as needed. But there is an issue with recognizing some currency symbols like "₹". So I did create a synthetic data set for it. Where the words had a combination of alphabets, numbers, currency symbols and other valid characters. So when I trained the recognition model, it then even failed to recognise the texts that it was recognising earlier. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 24 replies
-
|
Hi @Vishnu280412 👋, Could you please share the command you used to fine-tune the model ? :) In such cases I would suggest to freeze at a minimum the backbone with Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
For fine-tuning you should generate a more divers dataset (start with 200++ images) - Additional I can't suggest the linked repo - with SynthTiger it works much better - I have modified the code a bit (not really clean yet - https://github.com/felixdittrich92/synthtiger/tree/doctr-modified) - You can configure it to your own needs |
Beta Was this translation helpful? Give feedback.
Hey glad to hear 👍
Yes every model is limited to a fixed char length most models to 29/30 chars + possible EOS / SOS / PAD tokens (32 overall chars) except the
masterarchitecture to 47 chars + EOS SOS PAD (50) .Additional each crop is resized to 32x128 so there is no space to fit more chars .. but under the hood we use a split & merge logic for larger crops
2 days ago we pushed a improvement for this logic into the main branch with #1939 so I would suggest to pull the latest changes from the main branch and test again :)