- The Neuron
- Posts
- šø Google's new voice AI
šø Google's new voice AI
PLUS: The Neuron has a NEW website!
Welcome, humans.
ANNOUNCEMENT: We have a new website! Itās everything you love about The Neuron and more:
Go check it out and let us know what you think!
S/O to the awesome DemandFlow team for crafting this new site from start to finish.
Hereās what you need to know about AI today:
Google released Gemini Live, a new voice AI for Androidā¦
OpenRouter lets you use GPT-4's new 200-page output feature.
Sakana AI launched āThe AI Scientistā to automate research.
Job applications are increasingly using AI for cover letters / resumes.
Google has a new voice mode, and itās like what would happen if Apple Intelligence and Advanced Voice Mode had a baby.
Google just announced a ton of new AI stuff at its Made by Google 2024 event. The biggest news was probably Gemini Live, Googleās version of ChatGPTās Advanced Voice Mode (think AI that you can talk to).
Hereās what Gemini Live has got going on:
10 voices to choose from.
Hands-free operation, even when the phone is locked.
Pausable conversations, with the ability to interrupt it mid-sentence.
If you subscribe to Google One AI Premium, you can use it right now with the Gemini mobile app on Android (iOS coming soon).
The early reviews are in:
Itās faster than Googleās OG voice assistant, but the botās a little too chatty sometimes.
āBetter than talking to Siriā, but still hallucinates.
Conversationally, itās a step forward; functionally, itās not there yet.
The live demo was definitely live, with not one, but two on-stage tech fails. And this demo of the app in the wild shows you definitely need to cut it off to get a word in.
For Android owners: Gemini Live will also have ācontext-awareā capabilities. This will work similarly to Apple Intelligence, where Gemini can interact with other Google apps through whatās called Gemini extensions.
Here are a few that will launch over the next few weeks:
Keep: āGemini, jot down this recipe in my Keep notes.ā
Tasks: āAdd ābuy birthday giftā to my Google Tasks list for next week.ā
Calendar: āAm I free this Friday afternoon? Schedule a dentist appointment if so.ā
Youāll even be able to ask Gemini to perform quick calculations, convert units, and other utility functions without switching to a separate app.
Hereās why this matters: Some think Gemini Live might have an advantage over Apple Intelligence because of Googleās deep knowledge of the real-world.
That might also be why OpenAI is developing SearchGPT: once our AI can interface directly with real-world data, in real time, the functionality of any AI tool gets a lot more powerfulā¦ as long as they stop hallucinating fake playgrounds (looking at you, Gemini).
FROM OUR PARTNERS
šLearn how to unlock GenAI's power without risking your business.
āāProtecting your data shouldn't mean missing the GenAI revolutionābut while you're grappling with compliance concerns, your competitors are launching super-valuable GenAI apps.
Ready to do both?
The team at OctoAI has gathered a panel of industry experts to share insider tips and help you innovate securely and efficiently.
Sign up to join the free round table session on August 27th, and youāll learn how to navigate:
Data security.
Private deployment.
Regulatory compliance.
Quality evaluations.
ā¦And so much more!
Sign up now to secure your place. Can't make it live? No worriesāwhen you register, you still get the recording. Register here.
Around the Horn.
OpenRouter will let you use GPT-4oās new āextended outputā, where ChatGPT can now produce 200-page long outputs. Try it here.
Sakana AI released āThe AI Scientistā, an AI dedicated to automating scientific research and āopen-ended discoveryāāitās already started publishing papers.
Recruiters estimate that up to 50% of job applications now use AI to create cover letters and resumes, but the big 4 accounting firms are warning applicants not to.
Treats To Try.
ElevenLabs clones voices and generates AI voices for text-to-speech, using just one minute of audio (and today they released ElevenStudios, a fully managed AI dubbing service for videos and podcasts).
ChatwithPDF analyzes PDF documents and answers questions about their content in seconds.
Wondercraft generates complete podcast episodes from your initial text, including scripts, voiceovers, and music.
Chatbase creates custom chatbots from your content, which can be embedded on websites or accessed via API.
Metaview automates interview transcription, summarization, and provides feedback on interviewer performance.
*We have select secondary ad spots available in August. Get your brand in front of 450,000+ professionals here.
What gives it away that this is AI?
Every Monday/Wednesday, weāll post a new AI video, and your job is to tell us what about the video gives it away as being AI.
Ready? One watch, everyone knows the rules:
Hereās what #Neuron readers had to say about last Wednesdayās video:
M.U.: āItās the hands. Always the hands.ā
A.R.: āMovement of the natural elements (fire, water, burning) gives the appearance of each but does not follow laws of physics that one would expect from an experienced animatorā
Y.B.: āThe bad soundtrackā š
Where do you #Neuron?!
Submit where you #Neuron here for a chance to be featured in our newsletter next week!
Oliver, at the Olympics in the Grand Palais: Catching up with The Neuron before the competition starts.
Hayley from London, UK: Come on England!
Gladys from Atlanta, Georgia: Catching up on the Neuron Newsletter while chilling at the Georgia Department of Driver Services, waiting for my son to get his Learner's permit! šš
Joe from Kapaāa Kauai: Artificial intelligence combined with natural beauty. The perfect mix!
A Cat's Commentary.
Thatās all for today, for more AI treats, check out our website. The best way to support us is by checking out our sponsorsātodayās is OctoAI. See you cool cats on Twitter: @nonmayorpete & @noahedelman02 |
|