📄️ Introduction
This project was started as part of an intra-company (Syngenta) Gen-AI hackathon, in a team of 4 members.
📄️ Data
This tool is meant to handle the retention workflow of all the documents in the company.
📄️ Preprocessing
The first step is to extract all the text from a document and preprocess it into a format suitable to be analyzed.
📄️ Classification of documents
This is the core of the project - classifying the document into a subcategory.
📄️ Expiry
Once we have the subcategory of a document, computing the expiry is trivial.
📄️ Dashboard
We made a dashboard that shows all the current documents in a table, sorted in order of expiry date (nearest expiry date first), with the documents that have already expired being flagged red and user being notified.
📄️ Future
We have only scratched the surface. A lot more can be done for this.