“Safe” Is Not One Feeling

We often ask whether people think AI chatbots are safe. A better question is: safe how, and safe for whom?

We tend to talk about AI safety in blunt language. A system is safe or unsafe. Users trust it or they do not. A survey gives us a percentage, a headline turns that into a mood, and suddenly “the public” is treated like one person with one opinion.

That has always felt too simple to me. Safety is not one feeling when you are actually using a chatbot. You might worry about bad medical advice, but not about offensive language. You might find the tone polite while still wondering whether the answer is fair. You might use the tool every day and still avoid giving it anything personal.

A SOUPS 2025 paper, “Safety Perceptions of Generative AI Conversational Agents”, puts numbers around that messiness. Jan Tolsdorf, Alan F. Luo, Monica Kodwani, Junho Eum, Mahmood Sharif, Michelle L. Mazurek, and Adam J. Aviv surveyed 123 U.S. participants about how they perceive the safety of AI chatbots. Several of the authors are at George Washington University, and the work was supported through TRAILS, the NSF Institute for Trustworthy AI in Law & Society. It sits close to questions I keep coming back to in my own research: how people decide what AI systems are doing, when they feel comfortable using them, and when that comfort is earned.

Safety breaks into smaller questions

The paper starts with a useful move: it does not assume that risk, trust, and fairness are each one clean thing. The authors used factor analysis to see how people grouped their answers. The result was less tidy and more believable.

Risk split into three worries: discrimination, misinformation or unhelpfulness, and offensive language. Trust split into benevolence, reliability, and low personal risk. Fairness split into decision-making integrity and politeness. That last split matters. A chatbot can sound respectful and still leave someone unsure whether its judgments are fair.

Data figure showing median ratings for risk, trust, and fairness dimensions in a SOUPS 2025 study of AI chatbot safety perceptions. — Median ratings from the study. The scales all run from 1 to 5, but the meaning changes by construct: risk is likelihood of harm, trust is agreement with trust statements, and fairness is agreement with fairness statements.

The strongest risk concern was not slurs or obviously offensive replies. It was misinformation: the quiet, plausible answer that may be wrong. Offensive language had the lowest median risk rating. Discrimination sat in between. In other words, people were not just asking, “Will this chatbot say something awful?” They were also asking, “Will it confidently steer me wrong?”

The fairness results are just as useful. Participants gave politeness a higher median rating than decision-making integrity. That sounds small, but it names a problem I see in a lot of AI product language. A system can be smooth, friendly, and careful in tone without being fair in substance. Politeness is not the same as justice. It is not even the same as trust.

A chatbot can feel polite and still leave people unsure whether it is fair.

This is why a single safety score does not tell us much. If a company says people “trust” its chatbot, I want to know what kind of trust. Trust that it wants to help? Trust that it is reliable? Trust that using it will not expose the user to harm? Those are different judgments, and users seem to know that even when our dashboards do not.

There is no average user

The second part of the paper looks at how these perceptions cluster across people. Using latent class analysis, the authors identify three groups: Hesitant Skeptics, Cautious Trusters, and Confident Adopters.

Data figure comparing the three user groups from the SOUPS 2025 study: Hesitant Skeptics, Cautious Trusters, and Confident Adopters. — The study found three groups, not one average user. Frequent chatbot use rises from the skeptical group to the confident group, but usage does not explain everything.

The Hesitant Skeptics had the highest risk concerns and the lowest trust and fairness ratings. The Cautious Trusters were in the middle: fairly positive about trust and fairness, but still alert to risk. The Confident Adopters had the lowest risk concerns and the highest trust and fairness ratings.

Usage frequency was the clearest separator. Only 41% of Hesitant Skeptics reported using AI chatbots more than once a week, compared with 62% of Cautious Trusters and 75% of Confident Adopters. Familiarity seems to make chatbots feel safer.

But that is not the whole story. The paper notes that 41% of Hesitant Skeptics still used chatbots more than once a week, while 25% of Confident Adopters used them less frequently. So the easy story, “people worry because they do not use the tools enough,” does not hold. Some people use AI often and remain wary. Some people use it rarely and feel fine about it. Comfort and frequency overlap, but they are not the same thing.

Who gets to feel safe?

The demographic findings are the part I would handle most carefully. This was an exploratory study with 123 U.S. participants, and the authors are clear about the limits of that sample. The results should not be turned into a universal claim about whole communities.

Still, the patterns are worth taking seriously. Participants who identified as non-heterosexual were more likely to be Hesitant Skeptics and less likely to be Confident Adopters. Participants with higher incomes and participants who identified as non-White were more likely to be Confident Adopters. Older participants were underrepresented among Confident Adopters. Gender, disability status, and education level were not significantly associated with group membership.

The point is not to flatten any group into a single attitude. It is to avoid the lazy interpretation that skeptical users simply need more exposure. Skepticism can be a reasonable response to experience. If your identity has often been misread, stereotyped, or mishandled by technical systems, caution is not ignorance. It is data.

Skepticism is not a failure to understand the technology. Sometimes it is a clear read of what the technology has failed to understand.

What I take from it

The practical lesson is measurement. If people separate misinformation from discrimination, politeness from fairness, and benevolence from personal risk, then our tools should preserve those differences too. A simple average may be easy to report, but it can hide the concern that matters most.

It also makes me wary of mistaking a good tone for safety. A chatbot can sound calm and respectful without answering the harder questions: whether it is right, whether it treats people fairly, whether it handles sensitive contexts well, and whether the user understands the risk of relying on it.

Designing for an imaginary average user has the same problem. A reassurance that works for a Confident Adopter may do very little for a Hesitant Skeptic. That does not mean the skeptic is being difficult. It may mean the system has not earned trust in the dimension that matters to them.

So the question I want to keep asking is not only whether users feel comfortable. It is what kind of comfort, built on what evidence, and for whom. People already seem to understand that AI safety has more than one meaning. Our measurements and designs should too.

Shiza Ali

Safety breaks into smaller questions

There is no average user

Who gets to feel safe?

What I take from it