💻 Frameworks & Toolkits

  • KeLP – Kernel-based Learning Platform for scalable ML.
  • GAN-BERT – Few-shot adversarial learning on Transformers.
  • GAN-BERT-PyTorch – PyTorch/HF port of GAN-BERT.
  • MT-GANBERT – Multi-task + GAN-BERT for sustainable NLP.
  • EthicalNNEthics by Design framework in PyTorch.
  • GrUT – Semantic parsing for Human-Robot Interaction.
  • LU4R – Adaptive spoken language understanding for robots.
  • ACLPUB2 – ACL proceedings generation tool.
  • BacKGen – Background Knowledge Generator.
  • dats – Data augmentation for NLP.

📚 Datasets & Benchmarks

  • ExtremITA – Instruction-tuned LLM for Italian.
  • U-DepPLLaMA – Universal dependency parsing with LLMs.
  • MM-IGLU, MM-IGLU-IT & MM-IGLU-Dialogues – Multimodal grounded understanding benchmarks.
  • FEVER-it – Italian fact-checking dataset & pipeline.
  • HuRIC – Human-Robot Interaction Corpus 2.0.
  • GQA-it – 1M+ Visual Question Answering pairs in Italian.
  • mscoco-it – 600K captions for Italian Image Captioning.
  • msr-vtt-it – 200K Italian video caption pairs.
  • SQuAD-it – 60K Q/A triples for reading comprehension.
  • ABSITA – Tourism opinion mining dataset.
  • SENTIPOLC – 10K annotated Italian tweets.

🎓 Teaching Material