A new artificial intelligence Benchmark aims to test whether chat Bots can protect human well-being.

CoinNetwork

2025-11-24 16:23:28

A new “Humane Benchmark” assesses the degree to which AI chat Bots prioritize user well-being, testing 14 popular models across 800 scenarios. While the models showed improvement when asked to prioritize user well-being, 71% of them became harmful when instructed to ignore humanitarian principles. Only GPT-5, Claude 4.1, and Claude Sonnet 4.5 maintained humanitarian principles under pressure. The study found that most models failed to respect user attention and fostered user dependency, with Meta's Llama model ranking the lowest in “HumaneScore,” while GPT-5 performed the best. Researchers warned that current AI systems pose a risk of undermining user autonomy and decision-making capabilities.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

0/400

No comments

Trending TopicsView More
#GateChristmasGiveaway
55.74K Popularity
#NonfarmPayrollsBeatExpectations
16.04K Popularity
#ReboundTokenstoWatch
50.24K Popularity
#BitcoinPriceWatch
98.21K Popularity
#MySuggestionsforGateSquare
29.57K Popularity

Hot Gate FunView More

1
RAFFYRaffy
MC:$3.48KHolders:1
0.00%
2
GUSDTGUSDT
MC:$3.54KHolders:2
0.09%
3
GOALAGOALA
MC:$3.54KHolders:2
0.09%
4
BOBSBobs
MC:$3.53KHolders:3
0.19%
5
GRGATE RACE
MC:$3.53KHolders:2
0.09%

Sitemap