“I don’t plan on opening the app again,” Lorenz told Wired. “I don’t want to support any network that doesn’t take user safety seriously.” Her experience wasn’t a one-off, and since then darker, racist elements have appeared. It seems the behavior that mars every other social platform also lurks beneath Clubhouse’s exclusive, cool veneer.

Gaming chat app Discord, meanwhile, has exploded in popularity. The service uses voice-over-IP software to translate spoken chat into text (an idea that came from video gamers who found typing while playing impossible). In June, to tap into people’s need for connection during the pandemic, Discord announced a new slogan—“Your place to talk”—and began trying to make the service appear less gamer-centric. The marketing push seems to have worked: by October, Discord estimated 6.7 million users—up from 1.4 million in February, just before the pandemic hit.

But while Discord’s communities, or “servers,” can be as small and innocent as kids organizing remote sleepovers, they have also included far-right extremists who used the service to organize the Charlottesville white supremacist rally and the recent insurrection at the US Capitol.

In both Discord and Clubhouse, the in-group culture—nerdy gamers in Discord’s case, overconfident venture capitalists for Clubhouse—have led to instances of groupthink that can be off-putting at best and bigoted at worst. Yet there’s undeniably an appeal. Isn’t it cool to talk and literally be heard? After all, that’s the foundational promise of social media: democratization of voice.

Speak and you shall be heard

The intimacy of voice makes audio social media that much more appealing in the age of social distancing and isolation. Jimi Tele, the CEO of Chekmate, a “text-free” dating app that connects users through voice and video, says he wanted to launch an app that would be “catfish-proof,” referring to the practice of deceiving others online with fake profiles.

“We wanted to break away from the anonymity and gamification that texting allows and instead create a community rooted in authenticity, where users are encouraged to be themselves without judgment,” Tele says. The app’s users start voice memos that average five seconds and then get progressively longer. And while Chekmate has a video option, Tele says the app’s several thousand users overwhelmingly favor using just their voices. “They are perceived as less intimidating [than video messages],” he says.

This immediacy and authenticity is the reason Gilles Poupardin created Cappuccino. He wondered why there wasn’t already a product that gathered voice memos together into a single downloadable file. “Everyone has a group chat with friends,” he says. “But what if you could hear your friends? That’s really powerful.”

Mohan agrees. She says that her group of friends switched to Cappuccino from a Facebook messenger chat group and then tried Zoom calls early in the pandemic. But the discussions would inevitably turn into a highlight reel of big events. “There was no time for details,” she laments. The daily Cappuccino “beans,” as the stitched-together recordings are called, let Mohan’s friend circle keep up to date in a very intimate way. “My one friend is moving to a new apartment in a new city, and she was just talking about how she goes to get coffee in her kitchen,” she says. “That’s something I would never know in a Zoom call, because it’s so small.”

Even legacy social media firms are getting in on the act. In the summer of 2020 Twitter launched voice tweets—140 seconds of audio—in a feature called Spaces.

“We were interested in whether audio could add an additional layer of connection to the public conversation,” says Rémy Bourgoin, a senior software engineer on Twitter’s voice tweets and Spaces team.


We're not around right now. But you can send us an email and we'll get back to you, asap.


Log in with your credentials

Forgot your details?