4.1 C
London
Sunday, February 5, 2023

Two computer scientists explain how to recognize a deepfake

Must read

School food vendor apologizes for ‘unforgivable’ Black History Month menu, and it’s not the first time

A vendor that supplies meals to schools apologized for an "accidental insensitivity" Black History Month menu, echoing similar apologies it has made for more...

Elon Musk says Twitter will offer a free write-only API to bots that deliver ‘good’ content londonbusinessblog.com

Last week, Twitter said it will stop providing free access to its APIs starting February 9. Now days before the deadline, Elon Musk...

Laoshi’s $570,000 Card Game • londonbusinessblog.com

Most of the pitch deck teardowns to date (here's a handy list of the 30+ we've published so far) have been for institutional funding...

Malcolm Nance- Wiki, age, height, net worth, wife, ethnicity

Malcolm Nancy is the Chief Counter-Terrorism Analyst for MSNBC in the United States. Malcolm Nance is a citizen of Columbia County and a...
Shreya Christinahttps://londonbusinessblog.com
Shreya has been with londonbusinessblog.com for 3 years, writing copy for client websites, blog posts, EDMs and other mediums to engage readers and encourage action. By collaborating with clients, our SEO manager and the wider londonbusinessblog.com team, Shreya seeks to understand an audience before creating memorable, persuasive copy.

We did the opposite. By reversing many of these same techniques, we were able to get an approximation of a speaker’s vocal tract during a speech segment. This allowed us to effectively look into the anatomy of the speaker who created the audio sample.

From here on, we assumed that deepfake audio samples would not be limited by the same anatomical limitations that humans have. In other words, the analysis of deepfak audio samples simulated shapes of the vocal tract not found in humans.

Our test results not only confirmed our hypothesis, but also revealed something interesting. When extracting vocal tract estimates from deepfake audio, we found that the estimates were often comically incorrect. For example, it was common for deepfake audio to result in vocal channels with the same relative diameter and consistency as a drinking straw, as opposed to human vocal channels, which are much wider and more variable in shape.

This realization shows that deepfake audio, even when convincing to human listeners, is far from distinguishable from human-generated speech. By estimating the anatomy responsible for creating the perceived speech, it is possible to identify whether the audio was generated by a person or a computer.

Why this is important?

Today’s world is defined by the digital exchange of media and information. Everything from news to entertainment to conversations with loved ones usually happens through digital exchanges. Even in their infancy, deepfake video and audio undermine people’s trust in these exchanges, effectively limiting their usefulness.

If the digital world is to remain a critical source of information in people’s lives, effective and secure techniques for determining the source of an audio clip are critical.


Logan Blue is a Ph.D. student in computer and information science and engineering at the University of Florida. Patrick Traynor is a professor of computer and information science and engineering at the University of Florida.

This article was republished from The conversation under a Creative Commons license. Read the original article.

More articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest article

School food vendor apologizes for ‘unforgivable’ Black History Month menu, and it’s not the first time

A vendor that supplies meals to schools apologized for an "accidental insensitivity" Black History Month menu, echoing similar apologies it has made for more...

Elon Musk says Twitter will offer a free write-only API to bots that deliver ‘good’ content londonbusinessblog.com

Last week, Twitter said it will stop providing free access to its APIs starting February 9. Now days before the deadline, Elon Musk...

Laoshi’s $570,000 Card Game • londonbusinessblog.com

Most of the pitch deck teardowns to date (here's a handy list of the 30+ we've published so far) have been for institutional funding...

Malcolm Nance- Wiki, age, height, net worth, wife, ethnicity

Malcolm Nancy is the Chief Counter-Terrorism Analyst for MSNBC in the United States. Malcolm Nance is a citizen of Columbia County and a...

Apple could introduce a more expensive iPhone ‘Ultra’ in 2024

Apple is exploring the possibility of launching a more expensive iPhone "Ultra" over the iPhone Pro and Pro Max models, according to Apple tracker...