Posts by Tags

internship

My secondment at Oxford Wave Research

4 minute read

Published:

I spent two months, as part of my Marie Curie PhD program, at Oxford Wave Research, where I could learn about forensic speech technology from world-leading experts.

interspeech

A glimpse at Interspeech 2019 papers

2 minute read

Published:

Interspeech is drawing closer day by day, so I went through the planned schedule to cherry pick the titles that I found a bit more interesting than the others (from the perspective of pathological speech synthesis). I would like to use this opportunity to also go through the authors and show some impressive results.

machine-learning

My reproducibility practices for ML

2 minute read

Published:

I’m currently working on making some of my code reproducible (meaning: producing the same exact results) on different platforms, and some of the lessons learned in the past few days hit so hard, that I thought it would be worth sharing these with other people.

pathological-speech

Transcribing 3 hours of pathological speech

4 minute read

Published:

It has been again a month since my last blog post, but here we are again. Recently, I started a collaboration in Delft so that we can discover even more aspects of pathological speech. For that, I had to do transcriptions of three hours of oral cancer speech, and I’ve learned many lessons which I think are interesting for other people and might be widely applicable.

singing-syntheis

Fiddling with AI singing: the Mellotron

2 minute read

Published:

It was one of my mini PhD goals to train a full-fledged Tacotron 2 model, but I lacked the project goal and resources to do it. So my winter “vacation” came, meaning I could finally fiddle with the Mellotron model.

speech-synthesis

Fiddling with papers: MOSNet

4 minute read

Published:

One of the recurring themes in my PhD is the topic of naturalness of signals, in particular the naturalness of speech. It is challenging to define what we mean by naturalness – in fact this is the problem itself. Though, we could find a definition for it in “natural language”, it wouldn’t give us the mathematical formula for it.

Fiddling with AI singing: the Mellotron

2 minute read

Published:

It was one of my mini PhD goals to train a full-fledged Tacotron 2 model, but I lacked the project goal and resources to do it. So my winter “vacation” came, meaning I could finally fiddle with the Mellotron model.

training-event

Toulouse: the third TAPAS training event

2 minute read

Published:

Before Interspeech, we had a quick opportunity to showcase our results so far, and talk about data management and research ethics.