CRAIL Home Code & data People Publications Join us

AI oversight


Tools for monitoring the risks and impact of AI.

Publications

  1. J. Contro, S. Deol, Y. He, and M. Brandao, “ChatbotManip: A Dataset to Facilitate Evaluation and Oversight of Manipulative Chatbot Behaviour,” in TrustNLP: Sixth Workshop on Trustworthy Natural Language Processing, 2026. [Abstract] [Code] [arXiv] [PDF]
  2. S. Deol, J. Contro, and M. Brandao, “Is this Chatbot Trying to Sell Something? Towards Oversight of Chatbot Sales Tactics,” in Proceedings of the 9th Widening NLP Workshop, 2025, pp. 136–156. [Abstract] [arXiv]