uxdesign.cc
From human-like interactions to voice customization and accessibility, learn how to create smarter, more user-centered chatbots.The rise of AI has transformed how we think about product design and development. Platforms like GPT and Gemini have made it possible to create chatbots with unprecedented sophistication, bringing cutting-edge technology closer to everyday applications. But this isnt just about tools or capabilitiesits about a shift in how we approach designitself.For designers, the introduction of AI marks the beginning of a new chapter that requires us to rethink traditional processes and embrace entirely new methods. Building AI-powered products is far from a plug-and-play process; it demands careful attention to user experience, deeper insights into user behavior, and a commitment to crafting solutions beyond functionality. With AI, we have an extraordinary opportunity to connect with users more personally, creating tailored experiences that address their unique needs, preferences, and limitations.Over the past year, Ive been deeply immersed in designing an AI-driven chatbot, gathering valuable insights and experience along the way. In this article, Ill share some thoughts on how to make chatbot experiences feel more real, natural, and user-friendlyqualities that people genuinely seek in conversational AI.Designing the look of yourchatbotThere are a few schools of thought when it comes to visualizing chatbots. Faceless chatbots, like those of GPT, Gemini, or Google Assistant, are often represented by simple illustrations or iconsespecially in text mode, where their small avatar size requires a clear, recognizable design. In voice mode, these chatbots sometimes adopt abstract compositions, such as the visual styles seen with GPT, Gemini, or the recently refreshed Siri. This approach is common for AI models designed to be integrated into a variety of specific products. (For the record, Im a fan of Siris newlook!)ChatGPT & Gemini in voice chatmodeAs we delve deeper into building more specialized products, the avatar strategy tends to shift. In these cases, its not uncommon to see chatbots represented by character avatars. While some might find this approach too literal, it can be highly effective, particularly in contexts like customer service. However, this strategy comes with a potential pitfall: if the avatar appears very human-like but doesnt fully reach the level of realism needed to feel truly human, it risks crossing into the uncanny valley. This is that strange moment when the avatar feels almost human but not quite enough, creating an awkward or unsettling experience for users. Ill delve further into this issue in futureposts.Praktika.ai: Automated 11 tutorship powered by gen-AIavatarsChoosing the rightdesignIf youre unsure which approach to take, consider allowing users to customize the look of the chatbot in the settings. Provide a few different options, including abstract and literal representations, and let users choose their preferences. This approach not only personalizes the experience but also provides valuable insightsby analyzing the resulting data, you can identify trends and make more informed design decisions.Tailoring the voice: tone andstyleWith the advancement of products like ElevenLabs, we now have powerful tools to fine-tune the tone and style of a chatbots voice responses. Designers can decide whether they want the chatbot to respond in a neutral, generic tone, adopt a softer, whispering style, or even adapt its tone and intonation dynamically based on specific contexts.ElevenLabs.io: AI agent; testmodeWhy is this level of customization so crucial? For two reasons. First, in real life, the way we speak is rarely linear. Humans are emotional beings, and context almost always shapes our communication. For example, the tone we use when apologizing is very different from the tone we use when celebrating. To make the experience feel more authentic (and potentially increase user engagementthough theres a caveat, which Ill elaborate on at the end of this entry), its vital to align the speaking style of the chatbot with the weight of the words and the context of the conversation.Good communication is about more than just the words themselves. According to the 55/38/7 formula, only 7% of communication is conveyed through words. A significant 38% comes from vocal tone, and 55% is from nonverbal cues. This makes it essential for chatbots to respond in a manner that feels human and emotional. This doesnt just mean matching the tone to the context; it also requires the chatbot to interpret the users input on a deeper, more emotional level to ensure a truly natural interaction.The role ofaccentsAnother important aspect of a chatbots speaking style is its accent. For users outside English-speaking countries, theres often a perception of a standard British accent, sometimesthough less and less oftenassociated with Received Pronunciation (RP). However, within the UK, there are nearly 40 distinct regional accents, each with its unique character and identity, showcasing the true diversity of Englishspeech.https://medium.com/media/5f134159c2d2a6e96e2f34c7b6682a0b/hrefOne of the most surprising and entertaining updates to ChatGPTs voice mode has been its ability to adopt accents. But it doesnt stop at simply choosing an accent for your assistant, which is already a common feature. You can now ask the assistant to speak in a mixed accent, such as that of a Polish person who has lived in Ireland for years. GPT handles this surprisingly well, combining strong Eastern European pronunciation with the unique rhythm and intonation typical of Irish English, resulting in an authentic and highly entertaining interaction.ChatGPT: voice chat; choose a voicesectionNow, imagine youre designing a customer service chatbot for different regions of the UK. Instead of offering a one-size-fits-all voice, your chatbot could adopt the local accent of each region, creating a more relatable and tailored experience for users. For example, a chatbot in Newcastle could use a Geordie accent, while one in Birmingham might adopt the Brummie style. This level of customization would not only enhance user engagement but also add cultural familiarity, making the interaction feel more personal andgenuine.https://medium.com/media/75d038594ab5223a9db188ab6fc1a740/hrefCurrently, none of the available models offer a wide range of regional accents (which is unfortunate), but GPT does include a limited selection of English accents. With ongoing experiments in this area, the future of regional accent customization looks promising.Text reveal: balancing message length and user experienceWhen it comes to chatbot message length, platforms like GPT and Gemini generally aim to balance conciseness and depth. By default, these models prioritize concise responses while ensuring they fully address the users query. For instance, simple questions typically result in answers averaging around 2050words.However, not all chatbots need to follow this formula. For example, a Storytelling Chatbot might require longer and more engaging narratives to entertain users, where the goal extends beyond providing information.ChatGPT: text chatmodeWhy does thismatter?Aligning the message style with the products purpose and the conversations context is essential. At the same time, overly lengthy paragraphs can feel overwhelming, especially if the UI isnt designed to handle them effectively. Thoughtful text-reveal strategies and interactions play a vital role in ensuring a smooth user experience that aligns both UI andUX.Looking at popular AI models like GPT, Claude, Gemini, and Grok, we can observe notable differences in how information is revealed tousers:GPT and Claude present the text in a typewriter-like fashion, where the words appear as if theyre being typed out in real time. While this adds an element of dynamism, it can feel stressful for users who are more sensitive to visual stimulation or time pressure.Claude: text chatmodeGemini takes a different approach by displaying a shimmering preloader while the response is being generated, which can feel more anticipatory and lessjarring.Gemini: text chatmodeGrok and Pi.ai (built on Claude) stand out with a more subtle and polished reveal. Their text appears smoothly and pleasingly, making the experience particularly comfortable, especially when the generated content islengthy.Pi.ai: text chatmodeManaging cognitive loadAnother critical aspect of chatbot design is managing cognitive load by reducing visual clutter and maintaining focus. Platforms like Pi.ai, for instance, shift older responses out of view as new ones are generated. This approach keeps the interface clean and allows users to focus on the most relevant and recent information without being overwhelmed by chat historyclutter.Adjusting the pace of the responsesOne of the lesser-explored patterns in voice chatbots is providing settings to adjust the pace of the responses. While similar tools are commonly used by screen reader users, they remain a novelty in the context of voice chatbots.Now, imagine two simple sliders: one controlling the overall response rate (how fast the chatbot speaks), and another adjusting the pauses between sentences or paragraphs.This solution is both simple and incredibly powerful, yet its an area that hasnt been fully explored in AI chatbots. (Let me know in the comments if youve come across a chatbot that offers something similar!)VoiceOver settings: speaking ratesliderThis kind of customization could be particularly helpfulfor:Users with hearing difficulties need slower and clearer responses.Non-native speakers, often benefit from slower speech and longer pauses for comprehension.Users with cognitive challenges, for whom more deliberate pacing aids understanding.High-stress situations, where slower and calmer responses help reduce anxiety (e.g., mental health or crisis support chatbots).Integrating this feature would not only improve accessibility but also create a more personalized and user-friendly experience. Its a small addition with the potential for a bigimpact.Other conversation dynamics vs. UIpatternsWhen it comes to human-chatbot interaction, there are currently three primary UI patterns:Voice-to-Voice Mode: This is the most natural and hands-free option, where users dont need to interact with the device to communicate physically.Hold-to-Talk Mode: The user presses and holds a microphone button to speak to thechatbot.Record Mode: A familiar pattern found in most messaging apps, where the user records a message and sends it to the chatbot (or to a person) for processing.1: Voice-to-voice; 2: Hold-to-Talk; 3:RecordFrom a communication standpoint, hands-free voice-to-voice interaction feels the most natural. However, it presents significant UX challenges, even with advanced models like ChatGPT. One notable issue is that chatbots still struggle to accurately detect when a user has finished speaking.Enhancing voice interactionsIn the latest version of GPTs voice chatbot, there are still occasional scenarios where the assistant might step in prematurely if a user pauses mid-sentence to gather their thoughts. While this can interrupt the flow of the conversation, GPT offers some features that significantly improve the experience:Interruptibility: Users can interrupt the assistant mid-response. It immediately stops speaking and resumes listening, allowing the user to continue seamlessly.Adjustable Listening Time: Users can request the assistant to allow more time for their responses. This feature helps ensure that pauses for thinking dont lead to interruptions, resulting in a smoother conversational flow.These features make the latest GPT version one of the most advanced voice chat assistants available, demonstrating noticeable progress in addressing common challenges in voice-to-voice interactions.Reliable voice inputmethodsIf youre designing a chatbot interface, especially for voice interactions, its important to acknowledge these challenges. At the current stage of technology, the most reliable input methodsremain:Hold-to-Talk Buttons: A simple and familiar method that minimizes errors in detecting when the user is finished speaking.Record Mode: A practical and widely accepted solution for asynchronous voiceinput.While the hands-free voice-to-voice experience is improving rapidly, its not yet flawless. For now, designing with more controlled interaction patterns like hold-to-talk or record mode will provide a safer, more consistent user experience. Eventually, as technology advances, voice-to-voice interaction will likely become seamlessbut were not there quiteyet.SummaryAll the points mentioned above should not be taken as definitive advice for the design process. Since we are still in the early stages of the robotics eraand chatbots are, in essence, a form of roboticswe cannot fully predict how users will adapt to them. Some chatbots may excel with a more natural, human-like tone, while others might perform better with a rigid, robotic approach.As we navigate this new chapter in UX/UI design, its clear there is no universal formula or one-size-fits-all solution. The key to creating a high-performing chatbot lies in following an iterative process: designing, testing, learning, and repeating. Only through this cycle can we refine and adapt to meet the evolving needs and preferences ofusers.References I recommend to gothrough:AI: First New UI Paradigm in 60 Years by JakobNielsenWhat is chatbot design? byIBM.comThe Art of Building Customer-Facing AI Chatbots by Phaneendra KumarNamalaThe best links to get started with Conversational UI and chatbots by CaioBragaThe Power of Voice: How Sound Shapes Our Emotions and Interactions by MillianSpeaks | The Psychology ofSoundDesigning for AI: beyond the chatbot by RidhimaGuptaThe chatbot that mimics your accentand uses street slang by Mark Sellman for The SundayTimesCognitive Load and UI Design: Simplifying Interfaces for Enhanced User Experience by Jakub WojciechowskiDigital Accessibility: Understanding Screen Reader Interaction by Customer Experience PrudentialIterative Design: How to Optimize the Product Design Process by VladimirPavlovWeb Accessibility Tips: Give People Enough Time by Bureau of Internet AccessibilityBeyond the bot: redefining chatbot design in the age of AI was originally published in UX Collective on Medium, where people are continuing the conversation by highlighting and responding to this story.