• Nigerian/African Tech
  • Start Up
  • Internet
    • App
    • Mobile
    • Software
  • Gadgets
  • Money
  • Video
Tech News, Magazine & Review WordPress Theme 2017
  • Home
  • Africa
  • Business
  • Video
  • Metaverse
  • AI
  • Gadgets
  • Earnings
  • Tips
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
  • Home
  • Africa
  • Business
  • Video
  • Metaverse
  • AI
  • Gadgets
  • Earnings
  • Tips
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Home Cloud

Skype’s Real-Time Translator Learns How to Speak From Social Media

Femi by Femi
August 26, 2014
Share on FacebookShare on Twitter

Think you have trouble deciphering social media slang? Try translating it. Microsoft researchers have been studying how to translate social media, and in their efforts they came across a way to teach the company’s upcoming Skype Translator how to speak more like us.

Some researchers think social media could be key to getting computers to better understand humans. Social media experiments are “important examples of a new line of research in computational social science, showing that subtle social meaning can be automatically extracted from speech and text in a complex natural task,” says Dan Jurafsky, an expert in computational linguistics at Stanford, who recently led work on teaching computers about human interactions by listening to speed dating.

Also On TechBooky

Google Commits To Complying With EU Laws On Its Services

Here’s How ChatGPT Can Help Improve Your SEO

Here Are Key Differences Between Google Search And ChatGPT

How Machine Learning Can Be Used To Fight Government Corruption

OpenAI Begins Pilot Of ChatGPT Premium Version

The Skype Translator app, set for beta release later this year, translates multilingual conversations over the service as they’re happening. In May, Gurdeep Singh Pall, corporate vice president of Skype and Lync at Microsoft, and a German-speaking colleague demoed the app at the Code Conference, in Rancho Palos Verdes, Calif. As Pall spoke in English, both German and English subtitles scrolled along the bottom of the screen while real-time audio translation accompanied the subtitles.

The software system is a synthesis of several technologies, including speech recognition, machine translation, and speech synthesis. But Vikram Dendi, technical and strategy advisor at Microsoft Research, in Redmond, Wash., says past attempts to simply daisy-chain the technologies were unsuccessful because developers had failed to consider the drastic difference between the way we speak and the way we write.

For starters, real speech is peppered with vocalized “ums” and “ahs,” awkward pauses, varying intonations, and vocal stresses, which are all absent in text. Consider what would happen if a speech translation system misinterpreted the subtle difference between these two statements:

“You’re picking up the kids?”

And “You’re picking up the kids!”

Suffice it to say, grumpy offspring would be the end product.

The gap exists between translating text and translating speech because some of the best machine translation systems today are taught using large volumes of high-quality text, which does not include the awkwardness that speech recognition systems deal with. So Microsoft Research set about searching for techniques to help close that gap. Among them was a software system the company developed to translate social media musings.

Before turning to social media, Microsoft’s translation system extracted text from published books and Web sources that had been translated from one language to another. The data was then fed into a machine-learning pipeline that Microsoft calls phrasal statistical machine translation (phrasal SMT). The system chops up the text into a collection of small phrases called an n-gram, where n denotes the number of phrases. If the system is trying to translate, say, English to German, then the n-gram from a text in English is mapped to the n-gram of the equivalent text in German. This process teaches the computer what each phrase translates to.

Once it has learned its fill from the n-gram alignment, the software is ready to encounter new, untranslated text. When the machine is asked to translate a new phrase in English, the algorithm calculates the probability that the new English segment of text maps to one of the phrases it knows in German. The system then spits out the most probable translation.

Phrasal SMT excels at memorizing and matching data. For common phrases it can translate that exact phrase across several languages, and even if the words in the phrase are slightly reordered, it still works. But if the words in an uncommon phrase are reordered, the system gets confused. Some of the confusion arises because SMT doesn’t really understand grammar and so can’t shift from the rules of one language to those of another. For example, an English sentence usually runs subject, verb, object. But the same sentence in Japanese would be subject, object, verb.

This is why the Microsoft Research team pioneered a system known as syntactically

informed phrasal statistical machine translation (syntactic SMT). It builds on the phrasal SMT foundation but also understands syntax. Instead of just matching common phrases, syntactic SMT breaks up a phrase into individual words and then maps each word over to the other language.

Cutting up phrases and connecting individual words may sound like a primitive approach, but it’s not. “That’s pretty much the best method,” says Chris Manning, professor of linguistics and computer science at Stanford. “Microsoft’s machine translation team has been one of the prominent developers in this area, and basically, that is the state of the art in machine translation at the moment.”

Syntactic SMT was a big step, but there was room for improvement, particularly in the fast-growing universe of social media. The Microsoft Research team began studying communications on Facebook, by Short Message Service (SMS), and on Twitter to figure out the best way to manage conversational text.

But that came with a new set of problems. Each social media platform has its own distinct characteristics—Facebook posts incorporate more emotional expressions, SMS users type shorter messages, and tweets are something in between. So researchers had to first develop a social media text normalization system, software that could automatically adapt to these variations in style to produce something that syntactic SMT can process. Adding the normalizer system to the translator’s training protocol helped increase the accuracy of social-media text translation by 6 percent, according to Microsoft’s Dendi. “That significantly improved the quality,” he says. “Of course, there’s still a lot of work to do, but when we did this, it really did move the needle on understanding and translating that type of data better.” What’s more, the techniques developed to improve social media translation are very similar to what was needed to bridge the gap between speech recognition and translation.

Skype Translator isn’t the only speech translation system on the scene, though. According to Macduff Hughes, engineering director of Google Translate, many people use his company’s software to test their own ability to speak a foreign language. He also says that in the past year, Google has added new features on its mobile apps that allow people to use Translate in more scenarios. But the system doesn’t yet translate in real time and is not integrated into a video telephony application, which means multilingual speakers need to be in the same location and speak into the same app.

Google might be one of the only other companies with a shot at making a comparable system. Dendi says Microsoft’s Skype work required deep knowledge of the company’s Bing Web index to build the translation system, and a company would need similar assets to build another. “That’s why there are only a few places in the world that can build a system of this kind and scale that can serve millions and millions of customers in this fashion across a range of scenarios,” Dendi says.

source: Teresa Chong/ IEEE Spectrum

Related Posts:

  • Microsoft Tried To Buy Pinterest, Talks Are No Longer Holding For Now
    Microsoft Tried To Buy Pinterest, Talks Are No Longer…
  • Filings Show That Elon Musk Has Acquired A 9.2% Stake In Twitter
    Filings Show That Elon Musk Has Acquired A 9.2% Stake In…
  • Elon Musk Admits He Is Giving ‘Serious Thought’ To Build A New Social Media Platform
    Elon Musk Admits He Is Giving ‘Serious Thought’ To Build A…
  • Apple Will Allow Parler Back In The App Store Following Suspension
    Apple Will Allow Parler Back In The App Store Following…
  • Social Media Strategies That Are Guaranteed To Drive Traffic
    Social Media Strategies That Are Guaranteed To Drive Traffic
  • New Social Media Platform "Lips" Promises Sex Educators And Content Creators A New Experience
    New Social Media Platform "Lips" Promises Sex Educators And…
  • LinkedIn Rebrands Its Social Media With New Features Including Pronouns Update Including Pronouns
    LinkedIn Rebrands Its Social Media With New Features…
  • 7 Tech Skills That Every Working Student Needs
    7 Tech Skills That Every Working Student Needs
Femi

Femi

Paul Balo is a wireless communications technologist with interests in VoIP and 5G technologies. He leads the writing team at TechBooky

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

Loading

Recent

Tesla Cybertruck Mass Production Won’t Start Until 2024

Tesla Cybertruck Mass Production Won’t Start Until 2024

January 27, 2023
Apple Reportedly Delays Development Of Its Own WiFi Chips

Apple Reportedly Delays Development Of Its Own WiFi Chips

January 27, 2023
Google Commits To Complying With EU Laws On Its Services

Google Commits To Complying With EU Laws On Its Services

January 27, 2023
Airtel Launches Its eSIM Technology In Nigeria

Airtel Launches Its eSIM Technology In Nigeria

January 27, 2023
In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

January 27, 2023
How And How Not Gaming Can Be Used In Solving Real Problems

How And How Not Gaming Can Be Used In Solving Real Problems

January 27, 2023
Tesla Sues Former Employee For Allegedly Stealing Trade Secrets

Tesla Made The Most Money In 2022, But Its Future Still Rocky

January 26, 2023
Shutterstock Introduces Its Generative AI Image Tool

Shutterstock Introduces Its Generative AI Image Tool

January 26, 2023
Meta Agrees To $725M Settlement Of Cambridge Analytica Lawsuit

Meta Set To Reinstate Trump’s Facebook And Instagram Accounts

January 26, 2023
Here’s How ChatGPT Can Help Improve Your SEO

Here’s How ChatGPT Can Help Improve Your SEO

January 25, 2023

Browse Archives

January 2023
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
3031 
« Dec    

About Us

TechBooky

TechBooky is a social Tech blog with a special focus on the budding African Technology sector. TechBooky is currently based in Abuja, Nigeria.

Subscribe to TechBooky

Enter your email address to subscribe to TechBooky and receive notifications of new posts by email.

Join 24 other subscribers.

Receive top tech news directly in your inbox

Loading

Popular Tags

AI (252) amazon (95) android (281) app (610) Apple (473) artificial intelligence (265) business (338) china (113) cloud (135) cryptocurrency (158) ecommerce (109) enterprise (239) facebook (472) gadget (448) gaming (160) google (545) government (381) guest post (108) instagram (137) internet (352) ios (249) iphone (210) microsoft (261) mobile (281) new feature (287) nigeria (276) privacy (135) research (134) samsung (139) security (374) smartphone (235) social media (671) software (415) startup (268) streaming (140) telecom (157) tips (340) transport (104) twitter (216) united states (191) users (132) videos (115) website (159) whatsapp (129) youtube (106)

Quick Links

  • Home
  • Africa
  • Business
  • Video
  • Metaverse
  • AI
  • Gadgets
  • Earnings
  • Tips

Popular Post

  • Trending
  • Comments
  • Latest
Download Free Editable Resume Templates – Word / Docx – 2022

Download Free Editable Resume Templates – Word / Docx – 2022

July 25, 2022
The Best Free PC Games

The Best Free PC Games

July 29, 2022
Recover Permanently Deleted Emails From iCloud Manually

Recover Permanently Deleted Emails From iCloud Manually

March 5, 2022
Resume and Cover letter Templates for free

Resume and Cover letter Templates for free

July 25, 2022
How is Technology Changing Our Definition of What It Means to Be a Human?

How is Technology Changing Our Definition of What It Means to Be a Human?

April 1, 2018
[Fixed] “Outlook Running Slow Windows 10” Issue

[Fixed] “Outlook Running Slow Windows 10” Issue

February 12, 2020
Tesla Cybertruck Mass Production Won’t Start Until 2024

Tesla Cybertruck Mass Production Won’t Start Until 2024

January 27, 2023
Apple Reportedly Delays Development Of Its Own WiFi Chips

Apple Reportedly Delays Development Of Its Own WiFi Chips

January 27, 2023
Google Commits To Complying With EU Laws On Its Services

Google Commits To Complying With EU Laws On Its Services

January 27, 2023
Airtel Launches Its eSIM Technology In Nigeria

Airtel Launches Its eSIM Technology In Nigeria

January 27, 2023
In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

January 27, 2023
How And How Not Gaming Can Be Used In Solving Real Problems

How And How Not Gaming Can Be Used In Solving Real Problems

January 27, 2023

Recent News

Tesla Cybertruck Mass Production Won’t Start Until 2024

Tesla Cybertruck Mass Production Won’t Start Until 2024

January 27, 2023
Apple Reportedly Delays Development Of Its Own WiFi Chips

Apple Reportedly Delays Development Of Its Own WiFi Chips

January 27, 2023
Google Commits To Complying With EU Laws On Its Services

Google Commits To Complying With EU Laws On Its Services

January 27, 2023
Airtel Launches Its eSIM Technology In Nigeria

Airtel Launches Its eSIM Technology In Nigeria

January 27, 2023
In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

In Spite Of The Sucess Of Genetically Modified Foods, Debates Abound

January 27, 2023
How And How Not Gaming Can Be Used In Solving Real Problems

How And How Not Gaming Can Be Used In Solving Real Problems

January 27, 2023
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact us
  • Privacy Policy
  • Disclaimer
  • Login

© 2021 Design By Tech Booky Elite

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
  • Home
  • Africa
  • Business
  • Video
  • Metaverse
  • AI
  • Gadgets
  • Earnings
  • Tips

© 2021 Design By Tech Booky Elite