Greg Walters Ai
  • Green Screen
  • CricketUS
  • Bio
  • The Last Sales Trainer
  • back page
    • Writers >
      • Celeste Dame
    • Sources and Methods

Ai Prediction #9, An Ai with Human Speech capabilities.

5/16/2024

0 Comments

 
By Greg Walters

​Looking back, this is an easy one. Speech & Language

Another prediction confirmed.  The world will be talking to their data any day now.

​Yesterday, OpenAi announced Chat GPT o.  That's an 'o' not a zero. 'O' for Omni, hence the attached image and is a reference the multi-modal capabilities; speech, vision, etc.

This next level boasts a 'multi-modal' existence.  It simply means that the LLM can interact with the real world through mechanical view(camera), understand language(phone mic) and communicate verbally(speaker).

The demonstrations are pretty good.  The LLM comes across  conversational - it laughed and repeatably handled human interruptions.
"The new voice (and video) mode is the best computer interface I've ever used. It feels like AI from the movies," said OpenAI CEO Sam Altman in a blog post.
For now, voice recognition is 'magical' but soon, very soon, talking with your LLM will be so common, we'll forget we ever used the QWERTY,
​
With voice control, more end users and knowledge workers are accessing and making their own GPTs. Like training a new employee or assistant.  One who never needs directions repeated, but does require clear, distinct instructions - at first.  ​I think we are going to see departments and companies filled with people, automate their work. ​

Today, the LLM understands your speech and multiple languages.

​Here are some of the possibilities - today.
​
  1. Virtual Assistants: Siri, Google Assistant, and Alexa are popular examples. They help with tasks like setting reminders, answering questions, and controlling smart home devices.
  2. Customer Service: AI chatbots and voice assistants handle customer inquiries, provide support, and perform tasks like booking appointments and processing orders.
  3. Healthcare: Speech-enabled AI assists in patient documentation, transcribing doctor-patient interactions, and offering voice-activated interfaces for medical devices.
  4. Accessibility: These applications aid individuals with disabilities, such as voice-to-text for those with hearing impairments or voice-controlled devices for those with mobility issues.
  5. Language Translation: Tools like Google Translate use speech recognition to offer real-time translation, making communication easier across different languages.
  6. Education: AI-powered tools support language learning and reading comprehension, offering pronunciation guides and interactive learning experiences.
  7. Automotive: Voice-activated systems in cars allow drivers to control navigation, make calls, and manage entertainment without taking their hands off the wheel.
  8. Smart Homes: Speech-enabled AI controls lights, thermostats, security systems, and other smart devices, creating a more integrated and convenient living environment.
  9. Content Creation: Tools like Otter.ai transcribe meetings, interviews, and lectures, making it easier to generate written content from spoken words.
  10. Gaming: Voice commands in gaming enhance player interaction and provide more immersive experiences.

Tomorrow?
0 Comments

Your comment will be posted after it is approved.


Leave a Reply.


    Authors

    Greg Walters
    Charlie G. Peterson, IV
    Gabriella Paige Trenton
    Grayson Patrick Trent
    Gideon P. Tailor
    Jax T. Halloway

    Robert G. Jordan
    Dr. Jeremy Stone
    ​Grayson P. Trent


    View my profile on LinkedIn

    Archives

    December 2024
    November 2024
    September 2024
    August 2024
    July 2024
    June 2024
    May 2024
    April 2024
    March 2024
    February 2024
    January 2024
    December 2023
    November 2023
    October 2023
    September 2023
    August 2023
    July 2023

Greg Walters, Inc.

Who is Greg
Masthead
History
Who We've Worked with
​Email Us
​Disclaimer: This content is in part for educational purposes. I unequivocally denounce any form of violence, hate, harassment, or bullying. This page does not endorse or promote dangerous acts, organizations, or any forms of violence. In accordance with the 107 of the Copyright Act of 1976, this content is made available for "fair use" purposes, including criticism, comment, news reporting, teaching, scholarship, education, and research.
Greg Walters, Inc. Copyright 2030
  • Green Screen
  • CricketUS
  • Bio
  • The Last Sales Trainer
  • back page
    • Writers >
      • Celeste Dame
    • Sources and Methods