Lisbon Unit for Learning and Intelligent Systems
We are pleased to announce the creation of the Lisbon Unit for Learning and Intelligent Systems (LUMLIS), a unit of the
European Laboratory for Learning and Intelligent Systems (ELLIS), hosted at the Instituto Superior Técnico (IST) of the University of Lisbon (UL).
The HLTmeet Reading Group @ INESC-ID
This reading group meets regularly to discuss research topics on different sub-fields of Speech and Natural Language Processing.Reading Group Schedule
Summer Term 2023-2024 Wednesdays at 4:00 PM (room 336 or Zoom ) | ||
Date | Presenter | Topic |
May 22 | Carlos Carvalho | Memorizing Transformers - [paper] |
May 15 | Francisco Teixeira Thomas Rolland |
ICASSP 2024 |
May 8 | Thomas Rolland | Parameter efficient finetuning |
April 17 | John Mendonça | The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits - [paper] |
April 10 | Francisco Teixeira Thomas Rolland |
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features Improved Children's Automatic Speech Recognition Combining Adapters and Synthetic Data Augmentation Exploring Adapters with Conformers for Children's Automatic Speech Recognition |
April 3 | Isabel Trancoso | The Unreasonable Effectiveness of Eccentric Automatic Prompts - [paper] |
March 27 | Catarina Botelho | Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets - [paper] |
March 20 | Andreas Spanias, FIEEE, Professor and Center Director | Arizona State University | Quantum Machine Learning Simulations for Speech Processing Applications |
March 13 | Mariana Julião | CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training - [paper] |
March 6 | Anna Maria Pompili | Towards Designing a ChatGPT Conversational Companion for Elderly People - [paper] |
Winter Term 2023-2024 Wednesdays at 4:00 PM (room 336 or Zoom ) | ||
Date | Presenter | Topic |
Sep 13 | Catarina Botelho Mariana Julião |
Careful Whisper - leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification - [paper] The Androids Corpus: A New Publicly Available Benchmark for Speech Based Depression Detection - [paper] Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech - [paper] Which aspects of motor speech disorder are captured by Mel Frequency Cepstral Coefficients? Evidence from the change in STN-DBS conditions in Parkinson’s disease - [paper] Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously - [paper] Speech Self-Supervised Representation Benchmarking: Are We Doing it Right? - [paper] |
Sep 20 | Francisco Teixeira | Malafide: a novel adversarial convolutive noise attack against deepfake and spoofing detection systems - [paper] Vocoder drift in x-vector–based speaker anonymization - [paper] Mutual Information-based Embedding Decoupling for Generalizable Speaker Verification - [paper] pyannote.audio 2.1 speaker diarization pipeline: principle, benchmark, and recipe - [paper] |
Sep 27 | Carlos Carvalho Francisco Teixeira |
AfriNames: Most ASR Models "Butcher" African Names - [paper] MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets - [paper] Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data - [paper] |
Oct 4 | John Mendonça Gonçalo Raposo |
The Timing Bottleneck: Why Timing and Overlap Are Mission-Critical for Conversational User Interfaces, Speech Recognition and Dialogue Systems - [paper] ChatGPT vs. Crowdsourcing vs. Experts: Annotating Open-Domain Conversations With Speech Functions - [paper] Leveraging Large Language Models for Automated Dialogue Analysis - [paper] The Open-Domain Paradox for Chatbots: Common Ground as the Basis for Human-Like Dialogue - [paper] Approximating Online Human Evaluation of Social Chatbots With Prompting - [paper] Memories for Virtual AI Characters - [paper] |
Oct 18 | Carlos Carvalho | Transformers learn through gradual rank increase - [paper] |
Oct 25 | Gonçalo Raposo | Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution - [paper] |
Nov 8 | Mariana Julião Thomas Rolland |
Exploring the Utility of Automatically Generated References for Assessing L2 Prosody One Wide Feedforward is All You Need - [paper] |
Nov 15 | Francisco Teixeira | Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks - [paper] |
Nov 22 | John Mendonça | LLMaAA: Making Large Language Models as Active Annotators - [paper] |
Nov 29 | Rubén Solera-Ureña | Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting - [paper] |
Dec 6 | Annamaria Pompili | Large language models encode clinical knowledge - [paper] |
Dec 19 | Isabel Trancoso | Generative AI models should include detection mechanisms as a condition for public release - [paper] |
Jan 10 | John Mendonça | xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark - [paper] |
Jan 17 | Alberto Abad | Computing Education in the Era of Generative AI - [paper] |
Jan 24 | Carlos Carvalho | Evaluating context-invariance in unsupervised speech representations - [paper] |
Feb 7 | Rui Henriques | Workshop on Statistical Analysis - [paper] |
Feb 14 | Rubén Solera Ureña | Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text - [paper] |
Feb 21 | Francisco Teixeira | Towards Unbounded Machine Unlearning - [paper] |
Summer Term 2022-2023 Wednesdays at 4:00 PM (room 336 or Zoom ) | ||
Date | Presenter | Topic |
Feb 22 | Carlos Carvalho | Regeneration Learning: A Learning Paradigm for Data Generation - [paper] |
Mar 1 | Gonçalo Raposo | GPTScore: Evaluate as You Desire - [paper] |
Mar 15 | Fernando Batista | Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation - [paper] |
Mar 22 | Thomas Rolland | Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition - [paper] Towards Data Selection on TTS Data for Children’s Speech Recognition - [paper] |
Mar 29 | Francisco Teixeira | Encoder-Decoder Based Attractors for End-to-End Neural Diarization - [paper] |
April 5 | Higo Pires | Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach - [paper] |
April 12 | Catarina Botelho | Machine Love - [paper] |
April 19 | Rubén Solera-Ureña | Tips and tricks for researchers and reviewers |
April 26 | John Mendonça | Sparks of Artificial General Intelligence: Early experiments with GPT-4 - [paper] |
May 3 | Mariana Julião | Truth Is a Lie: Crowd Truth and the Seven Myths of Human Annotation - [paper] |
May 10 | Patrícia Pereira | Is ChatGPT Equipped with Emotional Dialogue Capabilities? - [paper] |
May 24 | Francisco Teixeira and Anna Havras |
ICASSP work and Master's thesis - [paper] |
May 31 | Isabel Trancoso | Interpreting Deep Representations of Phonetic Features via Neuro-Based Concept Detector: Application to Speech Disorders Due to Head and Neck Cancer - [paper] |
Jun 21 | Carlos Carvalho | Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding - [paper] |
Jun 28 | Rubén Solera-Ureña | Federated Learning for ASR based on Wav2vec 2.0 - [paper] |
Jul 5 | Gonçalo Raposo | DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines - [paper] |
Jul 19 | Alberto Abad | Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition - [paper] |
Jul 26 | Thomas Rolland | Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute - [paper] |
Winter Term 2022-2023 Wednesdays at 4:00 PM (room 336 or Zoom ) | ||
Date | Presenter | Topic |
Oct 12 | Carlos Carvalho | Robust Self-Supervised Audio-Visual Speech Recognition - [paper] |
Oct 19 | Discussion about ICASSP and Interspeech papers | |
Oct 26 | Gonçalo Raposo | Memorizing Transformers - [paper] |
Nov 2 | Francisco Teixeira | Introducing Model Inversion Attacks on Automatic Speaker Recognition - [paper] |
Nov 9 | Thomas Rolland | AudioLM: a Language Modeling Approach to Audio Generation - [paper] |
Nov 23 | Higo Pires | Evaluation of Sentiment Analysis in Finance: From Lexicons to Transformers - [paper] |
Nov 30 | Rubén Solera-Ureña | The Importance of Speech Stimuli for Pathologic Speech Classification - [paper] |
Jan 4 | Catarina Botelho | ChatGPT - [blog post] |
Jan 11 | John Mendonça | The Forward-Forward Algorithm: Some Preliminary Investigations - [paper] |
Jan 18 | Mariana Julião | On the Utility of Self-supervised Models for Prosody-related Tasks - [paper] |
Jan 25 | Alberto Abad | FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech - [paper] |
Feb 1 | Patrícia Pereira | Does GPT-3 Generate Empathetic Dialogues? A Novel In-Context Example Selection Method and Automatic Evaluation Metric for Empathetic Dialogue Generation - [paper] |
Feb 8 | Isabel Trancoso | On the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement Networks - [paper] |