The $118 pet whisperer: This Chinese smart collar claims to give you beloved pet a voice

The $118 pet whisperer: This Chinese smart collar claims to give you beloved pet a voice

Unlike the rudimentary novelty apps of the early 2000s that mapped sounds to static soundboards, PettiChat approaches animal communication through what founder Li Jingyuan calls an "Animal Behavior World Model."

Advertisement
International pre-orders eclipse 10,000 units ahead of its official May 30 release. International pre-orders eclipse 10,000 units ahead of its official May 30 release.
Business Today Desk
  • May 25, 2026,
  • Updated May 25, 2026 1:57 PM IST

A tiny, 27.2-gram plastic clip hanging from a cat’s collar might just be the most polarizing piece of technology in consumer artificial intelligence today. 

Developed by Hangzhou-based startup Meng Xiaoyi, the device, dubbed PettiChat, has sent social media into a frenzy by promising to solve humanity's oldest domestic mystery: what your pet is actually trying to say.

Advertisement

Backed by a freshly secured $1 million seed round and utilizing Alibaba Cloud’s massive Qwen large language model (LLM), the hardware startup claims its system can translate meows and barks into full human sentences within 1.2 seconds, boasting a contextual emotion-detection accuracy rate of up to 94.6% for cats and 92.3% for dogs. 

Yet, as international pre-orders eclipse 10,000 units ahead of its official May 30 release, the tech has run headfirst into a wall of scientific skepticism. Critics and veterinarians are already labeling the 799-yuan ($118) device an expensive "IQ tax" — a clever novelty engine masquerading as zoological breakthrough. 

From barks to tokens 

Unlike the rudimentary novelty apps of the early 2000s that mapped sounds to static soundboards, PettiChat approaches animal communication through what founder Li Jingyuan calls an "Animal Behavior World Model." 

Advertisement

The device views an animal's physical existence not as random noise, but as predictable data packets. 

"Humans already have large language models based on multi-modal data. We hope to build a world model for animals that includes visual, auditory, and behavioral signals." Li Jingyuan, Founder of PettiChat said.

The hardware relies on a complex multi-modal framework to process interspecies interactions: 

  • The Input: Built-in microphones, motion sensors, and accelerometers capture the pitch of an animal's vocalisation alongside real-time physical posture data (e.g., tail angles, ear positions, and pacing). 
  • The Brain: The collar processes these inputs against a proprietary database containing over 1.5 million real-world pet audio samples and 3,200 hours of annotated video reviewed by veterinarians. 
  • The Tokenisation: The AI treats movements and vocalizations as "behavior tokens," using a Transformer-based architecture to predict what the animal is experiencing before delivering a localized translation on a smartphone app. 

In promotional videos released by the company, the interface acts as a literal chatroom. A cat lets out a drawn-out meow, and the phone screen flashes: "I wanna play." A dog barks twice at the door, and the app reads: "I'm hungry."

Advertisement

More audaciously, the company claims the device features two-way communication, supposedly translating human commands into acoustic signals that pets can intuitively understand. 

The 95% question: Science or science fiction? 

Despite the viral marketing momentum, experts warn that the underlying science remains unproven. The primary point of friction is the company’s loudly advertised 95% accuracy metric, which has not been subjected to independent verification. 

Animal behaviorists point out that animal communication is highly dependent on situational context that a small collar sensor cannot fully capture. A dog barking near an empty food bowl implies a fundamentally different motivation than the same bark directed at a passing car. While PettiChat claims its spatial tracking and motion sensors bridge this gap, the company has yet to publish peer-reviewed studies or release its dataset for third-party validation. 

Furthermore, engineers question how the 27.2-gm device will filter out domestic chaos. In a typical apartment, microphones will invariably pick up traffic, television noise, music, vacuum cleaners, and human voices. While the model may operate with high precision inside a quiet, controlled laboratory environment, real-world living rooms introduce variable audio interference that can throw off sensitive voiceprint models. 

A tiny, 27.2-gram plastic clip hanging from a cat’s collar might just be the most polarizing piece of technology in consumer artificial intelligence today. 

Developed by Hangzhou-based startup Meng Xiaoyi, the device, dubbed PettiChat, has sent social media into a frenzy by promising to solve humanity's oldest domestic mystery: what your pet is actually trying to say.

Advertisement

Backed by a freshly secured $1 million seed round and utilizing Alibaba Cloud’s massive Qwen large language model (LLM), the hardware startup claims its system can translate meows and barks into full human sentences within 1.2 seconds, boasting a contextual emotion-detection accuracy rate of up to 94.6% for cats and 92.3% for dogs. 

Yet, as international pre-orders eclipse 10,000 units ahead of its official May 30 release, the tech has run headfirst into a wall of scientific skepticism. Critics and veterinarians are already labeling the 799-yuan ($118) device an expensive "IQ tax" — a clever novelty engine masquerading as zoological breakthrough. 

From barks to tokens 

Unlike the rudimentary novelty apps of the early 2000s that mapped sounds to static soundboards, PettiChat approaches animal communication through what founder Li Jingyuan calls an "Animal Behavior World Model." 

Advertisement

The device views an animal's physical existence not as random noise, but as predictable data packets. 

"Humans already have large language models based on multi-modal data. We hope to build a world model for animals that includes visual, auditory, and behavioral signals." Li Jingyuan, Founder of PettiChat said.

The hardware relies on a complex multi-modal framework to process interspecies interactions: 

  • The Input: Built-in microphones, motion sensors, and accelerometers capture the pitch of an animal's vocalisation alongside real-time physical posture data (e.g., tail angles, ear positions, and pacing). 
  • The Brain: The collar processes these inputs against a proprietary database containing over 1.5 million real-world pet audio samples and 3,200 hours of annotated video reviewed by veterinarians. 
  • The Tokenisation: The AI treats movements and vocalizations as "behavior tokens," using a Transformer-based architecture to predict what the animal is experiencing before delivering a localized translation on a smartphone app. 

In promotional videos released by the company, the interface acts as a literal chatroom. A cat lets out a drawn-out meow, and the phone screen flashes: "I wanna play." A dog barks twice at the door, and the app reads: "I'm hungry."

Advertisement

More audaciously, the company claims the device features two-way communication, supposedly translating human commands into acoustic signals that pets can intuitively understand. 

The 95% question: Science or science fiction? 

Despite the viral marketing momentum, experts warn that the underlying science remains unproven. The primary point of friction is the company’s loudly advertised 95% accuracy metric, which has not been subjected to independent verification. 

Animal behaviorists point out that animal communication is highly dependent on situational context that a small collar sensor cannot fully capture. A dog barking near an empty food bowl implies a fundamentally different motivation than the same bark directed at a passing car. While PettiChat claims its spatial tracking and motion sensors bridge this gap, the company has yet to publish peer-reviewed studies or release its dataset for third-party validation. 

Furthermore, engineers question how the 27.2-gm device will filter out domestic chaos. In a typical apartment, microphones will invariably pick up traffic, television noise, music, vacuum cleaners, and human voices. While the model may operate with high precision inside a quiet, controlled laboratory environment, real-world living rooms introduce variable audio interference that can throw off sensitive voiceprint models. 

Read more!
Advertisement