Marcely Zanon Boito

research

∙ 09/11/2023

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

Self-supervised learning (SSL) is at the origin of unprecedented improve...

0 Titouan Parcollet, et al. ∙

research

∙ 06/13/2023

NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track

This paper presents NAVER LABS Europe's systems for Tamasheq-French and ...

0 Edward Gow-Smith, et al. ∙

research

∙ 05/04/2022

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks

This paper describes the ON-TRAC Consortium translation systems develope...

2 Marcely Zanon Boito, et al. ∙

research

∙ 04/04/2022

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems

Self-supervised models for speech processing emerged recently as popular...

0 Marcely Zanon Boito, et al. ∙

research

∙ 01/13/2022

Speech Resources in the Tamasheq Language

In this paper we present two datasets for Tamasheq, a developing languag...

9 Marcely Zanon Boito, et al. ∙

research

∙ 06/08/2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings

When documenting oral-languages, Unsupervised Word Segmentation (UWS) fr...

0 Marcely Zanon Boito, et al. ∙

research

∙ 04/23/2021

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

Self-Supervised Learning (SSL) using huge unlabeled data has been succes...

0 Solene Evain, et al. ∙

research

∙ 03/30/2020

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation

For endangered languages, data collection campaigns have to accommodate ...

0 Marcely Zanon Boito, et al. ∙

research

∙ 10/30/2019

ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task

This paper describes the ON-TRAC Consortium translation systems develope...

0 Ha Nguyen, et al. ∙

research

∙ 10/11/2019

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages

For language documentation initiatives, transcription is an expensive re...

0 Marcely Zanon Boito, et al. ∙

research

∙ 07/30/2019

MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible

The CMU Wilderness Multilingual Speech Dataset is a newly published mult...

0 Marcely Zanon Boito, et al. ∙

research

∙ 06/29/2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings

Since Bahdanau et al. [1] first introduced attention for neural machine ...

0 Marcely Zanon Boito, et al. ∙

research

∙ 07/27/2018

A small Griko-Italian speech translation corpus

This paper presents an extension to a very low-resource parallel corpus ...

0 Marcely Zanon Boito, et al. ∙

research

∙ 06/18/2018

Unsupervised Word Segmentation from Speech with Attention

We present a first attempt to perform attentional word segmentation dire...

0 Pierre Godard, et al. ∙

research

∙ 09/17/2017

Unwritten Languages Demand Attention Too! Word Discovery with Encoder-Decoder Models

Word discovery is the task of extracting words from unsegmented text. In...

0 Marcely Zanon Boito, et al. ∙

Marcely Zanon Boito

Featured Co-authors

Sign in with Google

Consider DeepAI Pro