Classification of documents
This is the core of the project - classifying the document into a subcategory.
This is the core of the project - classifying the document into a subcategory.
We made a dashboard that shows all the current documents in a table, sorted in order of expiry date (nearest expiry date first), with the documents that have already expired being flagged red and user being notified.
This tool is meant to handle the retention workflow of all the documents in the company.
Once we have the subcategory of a document, computing the expiry is trivial.
We have only scratched the surface. A lot more can be done for this.
This project was started as part of an intra-company (Syngenta) Gen-AI hackathon, in a team of 4 members.
The first step is to extract all the text from a document and preprocess it into a format suitable to be analyzed.