Guide to Robustness Gym: Unifying the NLP Evaluation Landscape -

Once the AI/ML model is built, researchers spend a considerable amount of time to come up with different parameters on which that model should be evaluated. Evaluation methods are problem-specific. Recently, Stanford University along with Salesforce Research and UNC-Chapel Hill has proposed a system for the evaluation of NLP pipelines, commonly referred to as Robustness Gym.

#nlp #machine-learning

analyticsindiamag.com

Guide to Robustness Gym: Unifying the NLP Evaluation Landscape -