Our client partners with system integrators (SIs) to drive sales of its products and services. The SIs engage with potential business customers to identify upselling and cross-selling opportunities. SIs are paid incentives based on revenue generated through audit engagements.
To receive incentives, SIs must deliver valid proof of engagement (POE) documents (email, Word, or PDFs). Currently, verifying POE documents is a time-consuming manual process. Approval varies depending on who reads the documents. In some cases, reviewer bias leads to incorrect incentive payments.
We used AI techniques to classify POE documents as valid or invalid and restrict incentive payments for invalid documents.
We used Text Classification and Feature Extraction algorithms to classify POE documents as valid or invalid.
We started by parsing POE documents into a standard consumable format. We then preprocessed the parsed data to remove redundant and stop words. Next, we applied feature extraction techniques (N-grams TF and Unigrams TF-IDF) to convert variable length text into equal length feature vectors. The vectors represent keywords and their frequency in the text.
We trained the output from the two extraction techniques using a Two-Class Logistic regression model. We then finalized the best model, based on the optimum KPI values, and deployed it as a web service that is used by the POE application.
Using the Text Classification and Feature Extraction algorithms, our client was able to identify POE documents as valid or invalid, reducing labor and avoiding invalid incentive payouts.