Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info

If I read correctly, our patent is built upon the premise that we beat the F1 scores of the (then) leading LLM models.

The patent mentions a 74,38% F1 score.

We would urgently need to validate this premise that our approach is significantly superior. The advances in LLMs have been fast, and this benchmark seems may no longer be valid.

Attached are some instances for MWE matching with F1 scores of 80% and higher. With general available tools.

For example:

https://aclanthology.org/2021.emnlp-main.112.pdf

...

https://arxiv.org/pdf/2303.06623.pdf

...

Additionally, there is this one:

https://www.researchgate.net/publication/369912067_Interpretable_Unified_Language_Checking

...

Many LLMs are currently benchmarked on reasoning, knowledge as well:

...