Stability AI’s audio generator can now crank out 3 minute ‘songs’

Stability AI just unveiled Stable Audio 2.0, an upgraded version of its music-generation platform. This system lets users create up to three minutes of audio via text prompt. That’s around the length of an actual song, so it'll also whip up an intro, a full chord progression and an outro.

First, the good news. Three minutes is huge. The previous version of the software maxed out at 90 seconds. Just imagine the fake birthday song you could make in the style of that one Rob Thomas/Santana track. Another boon? The tool is free and publicly available through the company’s website, so have at it.

It primarily works via text prompt, but there’s an option to upload an audio clip. The system will analyze the clip and produce something similar. All uploaded audio must be copyright-free, so this isn’t for the purposes of mimicking something that already exists. Rather, it could be useful for, say, humming a drum part or extending a 20 second clip into something longer.

Now, the bad news. This is still AI-generated music. It’s cool as a conversation piece and as an emblem of a possible future that’s great for tinkerers and bad for musicians, but that’s about it. The songs can actually sound nifty, at first, until the seams start showing. Then things get a bit creepy.

For instance, the system loves adding vocals, but not in any known human language. I guess it’s in whatever language that makes up the text in AI-generated images. The vocals sort of sound like actual people, and other times they sound Gregorian chanters filtered through outer space. It’s right smack dab in the middle of that uncanny valley. The Verge called the vocals “soulless and weird," comparing them to whale sounds. That tracks. 

Stable Audio 2.0 makes the same weird little mistakes that all of these systems make, no matter the output type. Parts can vanish into thin air, replaced with something else. Sometimes melodic elements will double out of nowhere, like an audio version of those extra fingers in AI-generated images.

There’s also the, well, boring-ness of it all. This is music in name only. Without a human connection, what’s the point? I listen to music to get inside the head of another person or group of people. There’s no head to get inside of here, despite constant proclamations that artificial general intelligence (AGI) is only months away.

So, this tech is an absolute gift for those making silly birthday videos or bank hold music. For everyone else? Shrug. One thing I can say from personal experience: It’s pretty fast. The system concocted an absolutely terrifying big band song about my cat in around a minute. 

This article originally appeared on Engadget at https://www.engadget.com/stability-ais-audio-generator-can-now-crank-out-3-minute-songs-160620135.html?src=rss

Amazon just walked out on its self-checkout technology

Amazon is removing Just Walk Out tech from all of its Fresh grocery stores in the US, as reported by The Information. The self-checkout system relies on a host of cameras, sensors and good old-fashioned human eyeballs to track what people leave the store with, charging the customers accordingly.

The technology has been plagued by issues from the onset. Most notably, Just Walk Out merely presents the illusion of automation, with Amazon crowing about generative AI and the like. Here’s where the smoke and mirrors come in. While the stores have no actual cashiers, there are reportedly over 1,000 real people in India scanning the camera feeds to ensure accurate checkouts. 

It’s also incredibly expensive to install and maintain the necessary equipment, which is likely why Just Walk Out technology was only adopted at around half of Fresh stores in the US. There have been plenty of frustrating issues for consumers when using this system, from receipts being sent out hours after purchase to completely mismanaged orders. In other words, it took a vast array of sensitive equipment and 1,000 people staring at video feeds to do the job of one or two people sitting behind cash registers at each store. Ain’t modern innovation grand? To be fair, Amazon reached out to Engadget to say that the tech "has continued to scale while reducing the number of human reviews year-over-year." 

There’s also some major privacy concerns here. Remember those cameras and sensors? They can be used to collect biometric information as people shop. This goes beyond Amazon’s palm-scanning tech, as the cameras and sensors measure the shape and size of each customer’s body for identification and tracking purposes. This led to a class action suit in New York that accused the company's Amazon One technology of collecting biometric identifier information without properly disclosing the practices to consumers. 

The suit says that Amazon ran afoul of the state’s Biometric Identifier Information Law, which requires businesses to tell customers if they are collecting data used for identification purposes. Peter Romer-Friedman, an attorney representing the plaintiffs, told The Seattle Times that “Amazon owes its customers an explanation about how it’s operating these systems before people enter — so that people can decide for themselves whether they want to provide measurements of the size and shape of their body as a condition of getting a sandwich.” The company says that Just Walk Out, however, doesn't rely on the same biometric identifiers. 

Amazon tried to sell the technology to other retail chains, but didn’t get too many bites. It teamed up with Starbucks in a few locations and there was a small launch in hospitals for medical staff, but that’s about it. One sticking point? These systems require high ceilings to accommodate the cameras and sensors. Reuters also suggested that many retailers consider Amazon a competitor and disruptor, souring them on a technology partnership. Those 1,000 off-shore cashiers probably didn’t help with the sales pitch either.

Just Walk Out technology will continue to be offered in many stores in the UK. As for the US, Amazon says the removal of these systems is part of a larger effort to revamp its retail grocery arm. The company plans on bringing its Dash smart carts to retail locations, after a test at several Whole Foods and Fresh stores. These smart carts are equipped with scales and sensors to track spending in real time and, of course, allow consumers to skip the checkout.

Update, April 3, 2024, 2:10 PM ET: This story has been updated to include information provided by an Amazon rep regarding the specifics of the Just Walk Out technology. 

This article originally appeared on Engadget at https://www.engadget.com/amazon-just-walked-out-on-its-self-checkout-technology-191703603.html?src=rss

Amazon just walked out on its self-checkout technology

Amazon is removing Just Walk Out tech from all of its Fresh grocery stores in the US, as reported by The Information. The self-checkout system relies on a host of cameras, sensors and good old-fashioned human eyeballs to track what people leave the store with, charging the customers accordingly.

The technology has been plagued by issues from the onset. Most notably, Just Walk Out merely presents the illusion of automation, with Amazon crowing about generative AI and the like. Here’s where the smoke and mirrors come in. While the stores have no actual cashiers, there are reportedly over 1,000 real people in India scanning the camera feeds to ensure accurate checkouts. 

It’s also incredibly expensive to install and maintain the necessary equipment, which is likely why Just Walk Out technology was only adopted at around half of Fresh stores in the US. There have been plenty of frustrating issues for consumers when using this system, from receipts being sent out hours after purchase to completely mismanaged orders. In other words, it took a vast array of sensitive equipment and 1,000 people staring at video feeds to do the job of one or two people sitting behind cash registers at each store. Ain’t modern innovation grand? To be fair, Amazon reached out to Engadget to say that the tech "has continued to scale while reducing the number of human reviews year-over-year." 

There’s also some major privacy concerns here. Remember those cameras and sensors? They can be used to collect biometric information as people shop. This goes beyond Amazon’s palm-scanning tech, as the cameras and sensors measure the shape and size of each customer’s body for identification and tracking purposes. This led to a class action suit in New York that accused the company's Amazon One technology of collecting biometric identifier information without properly disclosing the practices to consumers. 

The suit says that Amazon ran afoul of the state’s Biometric Identifier Information Law, which requires businesses to tell customers if they are collecting data used for identification purposes. Peter Romer-Friedman, an attorney representing the plaintiffs, told The Seattle Times that “Amazon owes its customers an explanation about how it’s operating these systems before people enter — so that people can decide for themselves whether they want to provide measurements of the size and shape of their body as a condition of getting a sandwich.” The company says that Just Walk Out, however, doesn't rely on the same biometric identifiers. 

Amazon tried to sell the technology to other retail chains, but didn’t get too many bites. It teamed up with Starbucks in a few locations and there was a small launch in hospitals for medical staff, but that’s about it. One sticking point? These systems require high ceilings to accommodate the cameras and sensors. Reuters also suggested that many retailers consider Amazon a competitor and disruptor, souring them on a technology partnership. Those 1,000 off-shore cashiers probably didn’t help with the sales pitch either.

Just Walk Out technology will continue to be offered in many stores in the UK. As for the US, Amazon says the removal of these systems is part of a larger effort to revamp its retail grocery arm. The company plans on bringing its Dash smart carts to retail locations, after a test at several Whole Foods and Fresh stores. These smart carts are equipped with scales and sensors to track spending in real time and, of course, allow consumers to skip the checkout.

Update, April 3, 2024, 2:10 PM ET: This story has been updated to include information provided by an Amazon rep regarding the specifics of the Just Walk Out technology. 

This article originally appeared on Engadget at https://www.engadget.com/amazon-just-walked-out-on-its-self-checkout-technology-191703603.html?src=rss

Dave the Diver joins the PS Plus catalog on April 16

Dave the Diver is joining the PlayStation Plus catalog on April 16. If you’ve been on the fence about the ocean-faring adventure/restaurant sim, this is a good chance to check it out without spending any extra money, assuming your PS Plus membership hasn’t lapsed.

For those living under a coral reef, Dave the Diver is a wickedly addictive game that wears many hats. The gameplay splits into two primary components. During the day, you explore an ever-changing ocean, with fish to hunt, sharks to fight and mysteries to solve. The deeper you go, the weirder things get.

Once night falls, the action shifts to a sushi restaurant. You hire the staff, plan the menu and serve the guests. This is one part management sim and one part arcade game, with a hectic pace that recalls the coin-op classic Tapper.

The two gameplay mechanics shouldn’t mesh well, being so wildly different, but somehow they do. It’s like, uh, ocean-exploring peanut butter and sushi-making jelly. Dave the Diver is also surprisingly funny, with a large cast of oddballs both over and under the sea. Let me put it this way. You can hire an off-brand Jason Voorhees, a velociraptor and a ninja to be your waiters and sous chefs. There’s a reason why it made our list of the best games of 2023.

PS5 players are getting some slight improvements to suit the console, including haptic feedback that makes use of the adaptive triggers of the DualSense controllers. There’s also Godzilla-based DLC coming in May, which promises “even more enormous threats lurking in the depths.” The game’s already available for the Nintendo Switch and PC, though it remains absent from the Xbox catalog.

In addition to Dave the Diver, PS Plus members will soon be getting another treat. Sony just announced that the action-adventure title Tales of Kenzera: Zau will be a day one exclusive to PlayStation Plus on April 23.

This article originally appeared on Engadget at https://www.engadget.com/dave-the-diver-joins-the-ps-plus-catalog-on-april-16-154532307.html?src=rss

Dave the Diver joins the PS Plus catalog on April 16

Dave the Diver is joining the PlayStation Plus catalog on April 16. If you’ve been on the fence about the ocean-faring adventure/restaurant sim, this is a good chance to check it out without spending any extra money, assuming your PS Plus membership hasn’t lapsed.

For those living under a coral reef, Dave the Diver is a wickedly addictive game that wears many hats. The gameplay splits into two primary components. During the day, you explore an ever-changing ocean, with fish to hunt, sharks to fight and mysteries to solve. The deeper you go, the weirder things get.

Once night falls, the action shifts to a sushi restaurant. You hire the staff, plan the menu and serve the guests. This is one part management sim and one part arcade game, with a hectic pace that recalls the coin-op classic Tapper.

The two gameplay mechanics shouldn’t mesh well, being so wildly different, but somehow they do. It’s like, uh, ocean-exploring peanut butter and sushi-making jelly. Dave the Diver is also surprisingly funny, with a large cast of oddballs both over and under the sea. Let me put it this way. You can hire an off-brand Jason Voorhees, a velociraptor and a ninja to be your waiters and sous chefs. There’s a reason why it made our list of the best games of 2023.

PS5 players are getting some slight improvements to suit the console, including haptic feedback that makes use of the adaptive triggers of the DualSense controllers. There’s also Godzilla-based DLC coming in May, which promises “even more enormous threats lurking in the depths.” The game’s already available for the Nintendo Switch and PC, though it remains absent from the Xbox catalog.

In addition to Dave the Diver, PS Plus members will soon be getting another treat. Sony just announced that the action-adventure title Tales of Kenzera: Zau will be a day one exclusive to PlayStation Plus on April 23.

This article originally appeared on Engadget at https://www.engadget.com/dave-the-diver-joins-the-ps-plus-catalog-on-april-16-154532307.html?src=rss

OpenAI says it can clone a voice from just 15 seconds of audio

OpenAI just announced that it recently conducted a small-scale preview of a new tool called Voice Engine. This is a voice cloning technology that can mimic any speaker by analyzing a 15-second audio sample. The company says it generates “natural-sounding speech” with “emotive and realistic voices.”

The technology is based on the company’s pre-existing text-to-speech API and it has been in the works since 2022. OpenAI has already been using a version of the toolset to power the preset voices available in the current text-to-speech API and the Read Aloud feature. There are a bunch of samples on the company’s official blog and they sound eerily close to the real thing. I encourage you to give them a listen and imagine the possibilities, both good and bad.

OpenAI says they see this technology being useful for reading assistance, language translation and helping those who suffer from sudden or degenerative speech conditions. The company brought up a Brown University pilot program that helped a patient with speech impairment issues by creating a Voice Engine clone pulled from audio recorded for a school project.

Despite the potential benefits, bad actors would certainly abuse this technology to engage in some serious deepfake tomfoolery, which is already a problem. With this in mind, Voice Engine isn’t quite ready for prime time, as there are serious privacy concerns that must be met before a full rollout.

OpenAI acknowledges that this tech has “serious risks, which are especially top of mind in an election year.” The company says its incorporating feedback from “US and international partners from across government, media, entertainment, education, civil society and beyond” to ensure the product launches with a minimal amount of risk. All preview testers agreed to OpenAI’s usage policies, which ban the impersonation of another individual without consent or legal right.

Additionally, anybody using the tech will have to disclose to their audience that the voices are AI-generated. OpenAI implemented safety measures, like watermarking to trace the origin of any audio and “proactive monitoring” of how the system is being used. When the product officially rolls out there will be a “no-go voice list” that detects and prevents AI-generated speakers that are too similar to prominent figures.

As for when that rollout will occur, OpenAI remains tight-lipped. TechCrunch uncovered some potential pricing data and it looks like it will undercut competitors in the space like ElevenLabs. Voice Engine could cost $15 per one million characters, which works out to around 162,500 words. This is about the length of Stephen King’s The Shining. It certainly sounds like a budget-friendly way to get an audiobook done. The marketing materials also make reference to an “HD” version that costs twice as much, but the company hasn’t detailed how that will work.

OpenAI has been making big moves this week. It just announced another partnership with its bestie Microsoft to build an AI-based supercomputer called “Stargate.” The project will reportedly cost a whopping $100 billion, according to The Information.

This article originally appeared on Engadget at https://www.engadget.com/openai-says-it-can-clone-a-voice-from-just-15-seconds-of-audio-190356431.html?src=rss

OpenAI says it can clone a voice from just 15 seconds of audio

OpenAI just announced that it recently conducted a small-scale preview of a new tool called Voice Engine. This is a voice cloning technology that can mimic any speaker by analyzing a 15-second audio sample. The company says it generates “natural-sounding speech” with “emotive and realistic voices.”

The technology is based on the company’s pre-existing text-to-speech API and it has been in the works since 2022. OpenAI has already been using a version of the toolset to power the preset voices available in the current text-to-speech API and the Read Aloud feature. There are a bunch of samples on the company’s official blog and they sound eerily close to the real thing. I encourage you to give them a listen and imagine the possibilities, both good and bad.

OpenAI says they see this technology being useful for reading assistance, language translation and helping those who suffer from sudden or degenerative speech conditions. The company brought up a Brown University pilot program that helped a patient with speech impairment issues by creating a Voice Engine clone pulled from audio recorded for a school project.

Despite the potential benefits, bad actors would certainly abuse this technology to engage in some serious deepfake tomfoolery, which is already a problem. With this in mind, Voice Engine isn’t quite ready for prime time, as there are serious privacy concerns that must be met before a full rollout.

OpenAI acknowledges that this tech has “serious risks, which are especially top of mind in an election year.” The company says its incorporating feedback from “US and international partners from across government, media, entertainment, education, civil society and beyond” to ensure the product launches with a minimal amount of risk. All preview testers agreed to OpenAI’s usage policies, which ban the impersonation of another individual without consent or legal right.

Additionally, anybody using the tech will have to disclose to their audience that the voices are AI-generated. OpenAI implemented safety measures, like watermarking to trace the origin of any audio and “proactive monitoring” of how the system is being used. When the product officially rolls out there will be a “no-go voice list” that detects and prevents AI-generated speakers that are too similar to prominent figures.

As for when that rollout will occur, OpenAI remains tight-lipped. TechCrunch uncovered some potential pricing data and it looks like it will undercut competitors in the space like ElevenLabs. Voice Engine could cost $15 per one million characters, which works out to around 162,500 words. This is about the length of Stephen King’s The Shining. It certainly sounds like a budget-friendly way to get an audiobook done. The marketing materials also make reference to an “HD” version that costs twice as much, but the company hasn’t detailed how that will work.

OpenAI has been making big moves this week. It just announced another partnership with its bestie Microsoft to build an AI-based supercomputer called “Stargate.” The project will reportedly cost a whopping $100 billion, according to The Information.

This article originally appeared on Engadget at https://www.engadget.com/openai-says-it-can-clone-a-voice-from-just-15-seconds-of-audio-190356431.html?src=rss

You can now use your phone to get started with Amazon’s palm-reading tech

Amazon just launched an app that lets people sign up for its palm recognition service without having to head to an in-store kiosk. The Amazon One app uses a smartphone’s camera to take a photo of a palm print to set up an account. Once signed up, you can pay for stuff by using just your hand, ending the tyranny of having to carry a smartphone, cash or a burdensome plastic card.

The tech uses generative AI to analyze a palm's vein structure, turning the data into a “unique numerical, vector representation” which is recognized by scanning machines at retail locations. You’ll have to add a payment method within the app to get started and upload a photo of your ID for the purpose of age verification.

The app launches today for iOS and Android. Previously, you’d have to go to a physical location to sign up for Amazon One. Beyond payments, the tech is also used as an age verification tool and as a way to enter concerts and sporting events without having to bring along a ticket.

Once you hand over your palm-print to the completely benevolent Amazon corporation, you’ll have unfettered access to each and every Whole Foods grocery store throughout the country. Amazon, after all, owns Whole Foods. Amazon One payments are also accepted at some Panera Bread locations, in addition to certain airports, stadiums and convenience stores.

There are obvious privacy concerns here, as passwords can change but palms cannot. Amazon says that all uploaded palm images are “encrypted and sent to a secure Amazon One domain” in the Amazon Web Service cloud. The company also says the app “includes additional layers of spoof detection,” noting that it’s not possible to save or download palm images to the phone itself.

This article originally appeared on Engadget at https://www.engadget.com/you-can-now-use-your-phone-to-get-started-with-amazons-palm-reading-tech-184814302.html?src=rss

You can now use your phone to get started with Amazon’s palm-reading tech

Amazon just launched an app that lets people sign up for its palm recognition service without having to head to an in-store kiosk. The Amazon One app uses a smartphone’s camera to take a photo of a palm print to set up an account. Once signed up, you can pay for stuff by using just your hand, ending the tyranny of having to carry a smartphone, cash or a burdensome plastic card.

The tech uses generative AI to analyze a palm's vein structure, turning the data into a “unique numerical, vector representation” which is recognized by scanning machines at retail locations. You’ll have to add a payment method within the app to get started and upload a photo of your ID for the purpose of age verification.

The app launches today for iOS and Android. Previously, you’d have to go to a physical location to sign up for Amazon One. Beyond payments, the tech is also used as an age verification tool and as a way to enter concerts and sporting events without having to bring along a ticket.

Once you hand over your palm-print to the completely benevolent Amazon corporation, you’ll have unfettered access to each and every Whole Foods grocery store throughout the country. Amazon, after all, owns Whole Foods. Amazon One payments are also accepted at some Panera Bread locations, in addition to certain airports, stadiums and convenience stores.

There are obvious privacy concerns here, as passwords can change but palms cannot. Amazon says that all uploaded palm images are “encrypted and sent to a secure Amazon One domain” in the Amazon Web Service cloud. The company also says the app “includes additional layers of spoof detection,” noting that it’s not possible to save or download palm images to the phone itself.

This article originally appeared on Engadget at https://www.engadget.com/you-can-now-use-your-phone-to-get-started-with-amazons-palm-reading-tech-184814302.html?src=rss

Vizio just announced a $999 86-inch 4K TV

Walmart ag ’d have to sell a kidney to afford a giant 4K TV for the living room. That is no longer true, as television prices continue to decline. Case in point? Vizio just announced a new 86-inch 4K smart TV that costs just $999.

The first thing worth mentioning about the latest member of Vizio’s lineup of 4K televisions is its size. The next-biggest model is 75 inches and that’s already large enough to absolutely take over most spaces. This thing will be like having a Times Square billboard in your living room. That's not a bad thing, particularly for home theater buffs. 

We don’t know how this set will look in action yet, but it does offer a serious batch of features. Vizio says it “boasts the same powerful picture quality as its predecessors”, thanks to the inclusion of Dolby Vision HDR and HDR10+. The company also promises the TV can run games at 120 fps once you switch to 1080p. Here’s hoping the product can make good on this claim. Other features include dual-band Wi-Fi 6 connectivity and DTS:X audio.

Walmart agreed to buy Vizio last month for $23 billion, though the deal still faces regulatory approval. The 86-inch 4K TV officially goes on sale April 29 at both brick-and-mortar and digital retailers. 

This article originally appeared on Engadget at https://www.engadget.com/vizio-just-announced-a-999-86-inch-4k-tv-160030764.html?src=rss