AI Is Deciphering Animal Speech. Should We Try to Talk Back?
By
Isaac Schultz
Published May 17, 2025
|
Comments|
Scientists are using AI to decipher animal communication, creating some ethical conundrums. © Gizmodo [Illustration: St. Lumbroso, Photos: TatianaKim,Gulf MG/Shutterstock)
Chirps, trills, growls, howls, squawks. Animals converse in all kinds of ways, yet humankind has only scratched the surface of how they communicate with each other and the rest of the living world. Our species has trained some animals—and if you ask cats, animals have trained us, too—but we’ve yet to truly crack the code on interspecies communication.
Increasingly, animal researchers are deploying artificial intelligence to accelerate our investigations of animal communication—both within species and between branches on the tree of life. As scientists chip away at the complex communication systems of animals, they move closer to understanding what creatures are saying—and maybe even how to talk back. But as we try to bridge the linguistic gap between humans and animals, some experts are raising valid concerns about whether such capabilities are appropriate—or whether we should even attempt to communicate with animals at all. Using AI to untangle animal language Towards the front of the pack—or should I say pod?—is Project CETI, which has used machine learning to analyze more than 8,000 sperm whale “codas”—structured click patterns recorded by the Dominica Sperm Whale Project. Researchers uncovered contextual and combinatorial structures in the whales’ clicks, naming features like “rubato” and “ornamentation” to describe how whales subtly adjust their vocalizations during conversation. These patterns helped the team create a kind of phonetic alphabet for the animals—an expressive, structured system that may not be language as we know it but reveals a level of complexity that researchers weren’t previously aware of. Project CETI is also working on ethical guidelines for the technology, a critical goal given the risks of using AI to “talk” to the animals.
Meanwhile, Google and the Wild Dolphin Project recently introduced DolphinGemma, a large language modeltrained on 40 years of dolphin vocalizations. Just as ChatGPT is an LLM for human inputs—taking visual information like research papers and images and producing responses to relevant queries—DolphinGemma intakes dolphin sound data and predicts what vocalization comes next. DolphinGemma can even generate dolphin-like audio, and the researchers’ prototype two-way system, Cetacean Hearing Augmentation Telemetry, uses a smartphone-based interface that dolphins employ to request items like scarves or seagrass—potentially laying the groundwork for future interspecies dialogue. “DolphinGemma is being used in the field this season to improve our real-time sound recognition in the CHAT system,” said Denise Herzing, founder and director of the Wild Dolphin Project, which spearheaded the development of DolphinGemma in collaboration with researchers at Google DeepMind, in an email to Gizmodo. “This fall we will spend time ingesting known dolphin vocalizations and let Gemma show us any repeatable patterns they find,” such as vocalizations used in courtship and mother-calf discipline. In this way, Herzing added, the AI applications are two-fold: Researchers can use it both to explore dolphins’ natural sounds and to better understand the animals’ responses to human mimicking of dolphin sounds, which are synthetically produced by the AI CHAT system.
Expanding the animal AI toolkit Outside the ocean, researchers are finding that human speech models can be repurposed to decode terrestrial animal signals, too. A University of Michigan-led team used Wav2Vec2—a speech recognition model trained on human voices—to identify dogs’ emotions, genders, breeds, and even individual identities based on their barks. The pre-trained human model outperformed a version trained solely on dog data, suggesting that human language model architectures could be surprisingly effective in decoding animal communication. Of course, we need to consider the different levels of sophistication these AI models are targeting. Determining whether a dog’s bark is aggressive or playful, or whether it’s male or female—these are perhaps understandably easier for a model to determine than, say, the nuanced meaning encoded in sperm whale phonetics. Nevertheless, each study inches scientists closer to understanding how AI tools, as they currently exist, can be best applied to such an expansive field—and gives the AI a chance to train itself to become a more useful part of the researcher’s toolkit.
And even cats—often seen as aloof—appear to be more communicative than they let on. In a 2022 study out of Paris Nanterre University, cats showed clear signs of recognizing their owner’s voice, but beyond that, the felines responded more intensely when spoken to directly in “cat talk.” That suggests cats not only pay attention to what we say, but also how we say it—especially when it comes from someone they know. Earlier this month, a pair of cuttlefish researchers found evidence that the animals have a set of four “waves,” or physical gestures, that they make to one another, as well as to human playback of cuttlefish waves. The group plans to apply an algorithm to categorize the types of waves, automatically track the creatures’ movements, and understand the contexts in which the animals express themselves more rapidly.
Private companiesare also getting in on the act. Last week, China’s largest search engine, Baidu, filed a patent with the country’s IP administration proposing to translate animalvocalizations into human language. The quick and dirty on the tech is that it would intake a trove of data from your kitty, and then use an AI model to analyze the data, determine the animal’s emotional state, and output the apparent human language message your pet was trying to convey. A universal translator for animals? Together, these studies represent a major shift in how scientists are approaching animal communication. Rather than starting from scratch, research teams are building tools and models designed for humans—and making advances that would have taken much longer otherwise. The end goal couldbe a kind of Rosetta Stone for the animal kingdom, powered by AI.
“We’ve gotten really good at analyzing human language just in the last five years, and we’re beginning to perfect this practice of transferring models trained on one dataset and applying them to new data,” said Sara Keen, a behavioral ecologist and electrical engineer at the Earth Species Project, in a video call with Gizmodo. The Earth Species Project plans to launch its flagship audio-language model for animal sounds, NatureLM, this year, and a demo for NatureLM-audio is already live. With input data from across the tree of life—as well as human speech, environmental sounds, and even music detection—the model aims to become a converter of human speech into animal analogues. The model “shows promising domain transfer from human speech to animal communication,” the project states, “supporting our hypothesis that shared representations in AI can help decode animal languages.” “A big part of our work really is trying to change the way people think about our place in the world,” Keen added. “We’re making cool discoveries about animal communication, but ultimately we’re finding that other species are just as complicated and nuanced as we are. And that revelation is pretty exciting.”
The ethical dilemma Indeed, researchers generally agree on the promise of AI-based tools for improving the collection and interpretation of animal communication data. But some feel that there’s a breakdown in communication between that scholarly familiarity and the public’s perception of how these tools can be applied. “I think there’s currently a lot of misunderstanding in the coverage of this topic—that somehow machine learning can create this contextual knowledge out of nothing. That so long as you have thousands of hours of audio recordings, somehow some magic machine learning black box can squeeze meaning out of that,” said Christian Rutz, an expert in animal behavior and cognition and founding president of International Bio-Logging Society, in a video call with Gizmodo. “That’s not going to happen.” “Meaning comes through the contextual annotation and this is where I think it’s really important for this field as a whole, in this period of excitement and enthusiasm, to not forget that this annotation comes from basic behavioral ecology and natural history expertise,” Rutz added. In other words, let’s not put the horse before the cart, especially since the cart—in this case—is what’s powering the horse. But with great power…you know the cliché. Essentially, how can humans develop and apply these technologies in a way that is both scientifically illuminating and minimizes harm or disruption to its animal subjects? Experts have put forward ethical standards and guardrails for using the technologies that prioritize the welfare of creatures as we get closer to—well, wherever the technology is going.
As AI advances, conversations about animal rights will have to evolve. In the future, animals could become more active participants in those conversations—a notion that legal experts are exploring as a thought exercise, but one that could someday become reality. “What we desperately need—apart from advancing the machine learning side—is to forge these meaningful collaborations between the machine learning experts and the animal behavior researchers,” Rutz said, “because it’s only when you put the two of us together that you stand a chance.”
There’s no shortage of communication data to feed into data-hungry AI models, from pitch-perfect prairie dog squeaks to snails’ slimy trails. But exactly how we make use of the information we glean from these new approaches requires thorough consideration of the ethics involved in “speaking” with animals. A recent paper on the ethical concerns of using AI to communicate with whales outlined six major problem areas. These include privacy rights, cultural and emotional harm to whales, anthropomorphism, technological solutionism, gender bias, and limited effectiveness for actual whale conservation. That last issue is especially urgent, given how many whale populations are already under serious threat.
It increasingly appears that we’re on the brink of learning much more about the ways animals interact with one another—indeed, pulling back the curtain on their communication could also yield insights into how they learn, socialize, and act within their environments. But there are still significant challenges to overcome, such as asking ourselves how we use the powerful technologies currently in development.
Daily Newsletter
You May Also Like
By
Lucas Ropek
Published May 16, 2025
By
Matt Novak
Published May 16, 2025
By
Isaiah Colbert
Published May 16, 2025
By
Matt Novak
Published May 15, 2025
By
Matt Novak
Published May 14, 2025
By
Kyle Barr
Published May 13, 2025
#deciphering #animal #speech #should #try
AI Is Deciphering Animal Speech. Should We Try to Talk Back?
By
Isaac Schultz
Published May 17, 2025
|
Comments|
Scientists are using AI to decipher animal communication, creating some ethical conundrums. © Gizmodo [Illustration: St. Lumbroso, Photos: TatianaKim,Gulf MG/Shutterstock)
Chirps, trills, growls, howls, squawks. Animals converse in all kinds of ways, yet humankind has only scratched the surface of how they communicate with each other and the rest of the living world. Our species has trained some animals—and if you ask cats, animals have trained us, too—but we’ve yet to truly crack the code on interspecies communication.
Increasingly, animal researchers are deploying artificial intelligence to accelerate our investigations of animal communication—both within species and between branches on the tree of life. As scientists chip away at the complex communication systems of animals, they move closer to understanding what creatures are saying—and maybe even how to talk back. But as we try to bridge the linguistic gap between humans and animals, some experts are raising valid concerns about whether such capabilities are appropriate—or whether we should even attempt to communicate with animals at all. Using AI to untangle animal language Towards the front of the pack—or should I say pod?—is Project CETI, which has used machine learning to analyze more than 8,000 sperm whale “codas”—structured click patterns recorded by the Dominica Sperm Whale Project. Researchers uncovered contextual and combinatorial structures in the whales’ clicks, naming features like “rubato” and “ornamentation” to describe how whales subtly adjust their vocalizations during conversation. These patterns helped the team create a kind of phonetic alphabet for the animals—an expressive, structured system that may not be language as we know it but reveals a level of complexity that researchers weren’t previously aware of. Project CETI is also working on ethical guidelines for the technology, a critical goal given the risks of using AI to “talk” to the animals.
Meanwhile, Google and the Wild Dolphin Project recently introduced DolphinGemma, a large language modeltrained on 40 years of dolphin vocalizations. Just as ChatGPT is an LLM for human inputs—taking visual information like research papers and images and producing responses to relevant queries—DolphinGemma intakes dolphin sound data and predicts what vocalization comes next. DolphinGemma can even generate dolphin-like audio, and the researchers’ prototype two-way system, Cetacean Hearing Augmentation Telemetry, uses a smartphone-based interface that dolphins employ to request items like scarves or seagrass—potentially laying the groundwork for future interspecies dialogue. “DolphinGemma is being used in the field this season to improve our real-time sound recognition in the CHAT system,” said Denise Herzing, founder and director of the Wild Dolphin Project, which spearheaded the development of DolphinGemma in collaboration with researchers at Google DeepMind, in an email to Gizmodo. “This fall we will spend time ingesting known dolphin vocalizations and let Gemma show us any repeatable patterns they find,” such as vocalizations used in courtship and mother-calf discipline. In this way, Herzing added, the AI applications are two-fold: Researchers can use it both to explore dolphins’ natural sounds and to better understand the animals’ responses to human mimicking of dolphin sounds, which are synthetically produced by the AI CHAT system.
Expanding the animal AI toolkit Outside the ocean, researchers are finding that human speech models can be repurposed to decode terrestrial animal signals, too. A University of Michigan-led team used Wav2Vec2—a speech recognition model trained on human voices—to identify dogs’ emotions, genders, breeds, and even individual identities based on their barks. The pre-trained human model outperformed a version trained solely on dog data, suggesting that human language model architectures could be surprisingly effective in decoding animal communication. Of course, we need to consider the different levels of sophistication these AI models are targeting. Determining whether a dog’s bark is aggressive or playful, or whether it’s male or female—these are perhaps understandably easier for a model to determine than, say, the nuanced meaning encoded in sperm whale phonetics. Nevertheless, each study inches scientists closer to understanding how AI tools, as they currently exist, can be best applied to such an expansive field—and gives the AI a chance to train itself to become a more useful part of the researcher’s toolkit.
And even cats—often seen as aloof—appear to be more communicative than they let on. In a 2022 study out of Paris Nanterre University, cats showed clear signs of recognizing their owner’s voice, but beyond that, the felines responded more intensely when spoken to directly in “cat talk.” That suggests cats not only pay attention to what we say, but also how we say it—especially when it comes from someone they know. Earlier this month, a pair of cuttlefish researchers found evidence that the animals have a set of four “waves,” or physical gestures, that they make to one another, as well as to human playback of cuttlefish waves. The group plans to apply an algorithm to categorize the types of waves, automatically track the creatures’ movements, and understand the contexts in which the animals express themselves more rapidly.
Private companiesare also getting in on the act. Last week, China’s largest search engine, Baidu, filed a patent with the country’s IP administration proposing to translate animalvocalizations into human language. The quick and dirty on the tech is that it would intake a trove of data from your kitty, and then use an AI model to analyze the data, determine the animal’s emotional state, and output the apparent human language message your pet was trying to convey. A universal translator for animals? Together, these studies represent a major shift in how scientists are approaching animal communication. Rather than starting from scratch, research teams are building tools and models designed for humans—and making advances that would have taken much longer otherwise. The end goal couldbe a kind of Rosetta Stone for the animal kingdom, powered by AI.
“We’ve gotten really good at analyzing human language just in the last five years, and we’re beginning to perfect this practice of transferring models trained on one dataset and applying them to new data,” said Sara Keen, a behavioral ecologist and electrical engineer at the Earth Species Project, in a video call with Gizmodo. The Earth Species Project plans to launch its flagship audio-language model for animal sounds, NatureLM, this year, and a demo for NatureLM-audio is already live. With input data from across the tree of life—as well as human speech, environmental sounds, and even music detection—the model aims to become a converter of human speech into animal analogues. The model “shows promising domain transfer from human speech to animal communication,” the project states, “supporting our hypothesis that shared representations in AI can help decode animal languages.” “A big part of our work really is trying to change the way people think about our place in the world,” Keen added. “We’re making cool discoveries about animal communication, but ultimately we’re finding that other species are just as complicated and nuanced as we are. And that revelation is pretty exciting.”
The ethical dilemma Indeed, researchers generally agree on the promise of AI-based tools for improving the collection and interpretation of animal communication data. But some feel that there’s a breakdown in communication between that scholarly familiarity and the public’s perception of how these tools can be applied. “I think there’s currently a lot of misunderstanding in the coverage of this topic—that somehow machine learning can create this contextual knowledge out of nothing. That so long as you have thousands of hours of audio recordings, somehow some magic machine learning black box can squeeze meaning out of that,” said Christian Rutz, an expert in animal behavior and cognition and founding president of International Bio-Logging Society, in a video call with Gizmodo. “That’s not going to happen.” “Meaning comes through the contextual annotation and this is where I think it’s really important for this field as a whole, in this period of excitement and enthusiasm, to not forget that this annotation comes from basic behavioral ecology and natural history expertise,” Rutz added. In other words, let’s not put the horse before the cart, especially since the cart—in this case—is what’s powering the horse. But with great power…you know the cliché. Essentially, how can humans develop and apply these technologies in a way that is both scientifically illuminating and minimizes harm or disruption to its animal subjects? Experts have put forward ethical standards and guardrails for using the technologies that prioritize the welfare of creatures as we get closer to—well, wherever the technology is going.
As AI advances, conversations about animal rights will have to evolve. In the future, animals could become more active participants in those conversations—a notion that legal experts are exploring as a thought exercise, but one that could someday become reality. “What we desperately need—apart from advancing the machine learning side—is to forge these meaningful collaborations between the machine learning experts and the animal behavior researchers,” Rutz said, “because it’s only when you put the two of us together that you stand a chance.”
There’s no shortage of communication data to feed into data-hungry AI models, from pitch-perfect prairie dog squeaks to snails’ slimy trails. But exactly how we make use of the information we glean from these new approaches requires thorough consideration of the ethics involved in “speaking” with animals. A recent paper on the ethical concerns of using AI to communicate with whales outlined six major problem areas. These include privacy rights, cultural and emotional harm to whales, anthropomorphism, technological solutionism, gender bias, and limited effectiveness for actual whale conservation. That last issue is especially urgent, given how many whale populations are already under serious threat.
It increasingly appears that we’re on the brink of learning much more about the ways animals interact with one another—indeed, pulling back the curtain on their communication could also yield insights into how they learn, socialize, and act within their environments. But there are still significant challenges to overcome, such as asking ourselves how we use the powerful technologies currently in development.
Daily Newsletter
You May Also Like
By
Lucas Ropek
Published May 16, 2025
By
Matt Novak
Published May 16, 2025
By
Isaiah Colbert
Published May 16, 2025
By
Matt Novak
Published May 15, 2025
By
Matt Novak
Published May 14, 2025
By
Kyle Barr
Published May 13, 2025
#deciphering #animal #speech #should #try
·66 Visualizações