NYC’s government chatbot is lying about city laws and regulations

Real Hacker StaffMarch 29, 2024

2 3 minutes read

Enlarge / Has a government employee checked all those zeroes and ones floating above the skyline?

If you follow generative AI news at all, you’re probably familiar with LLM chatbots’ tendency to “confabulate” incorrect information while presenting that information as authoritatively true. That tendency seems poised to cause some serious problems now that a chatbot run by the New York City government is making up incorrect answers to some important questions of local law and municipal policy.

NYC’s “MyCity” ChatBot launched as a “pilot” program last October. The announcement touted the ChatBot as a way for business owners to “save … time and money by instantly providing them with actionable and trusted information from more than 2,000 NYC Business webpages and articles on topics such as compliance with codes and regulations, available business incentives, and best practices to avoid violations and fines.”

But a new report from The Markup and local nonprofit news site The City found the MyCity chatbot giving dangerously wrong information about some pretty basic city policies. To cite just one example, the bot said that NYC buildings “are not required to accept Section 8 vouchers,” when an NYC government info page says clearly that Section 8 housing subsidies are one of many lawful sources of income that landlords are required to accept without discrimination. The Markup also received incorrect information in response to chatbot queries regarding worker pay and work hour regulations, as well as industry-specific information like funeral home pricing.

Enlarge / Welcome news for people who think the rent is too damn high, courtesy of the MyCity chatbot.

Further testing from BlueSky user Kathryn Tewson shows the MyCity chatbot giving some dangerously wrong answers regarding the treatment of workplace whistleblowers, as well as some hilariously bad answers regarding the need to pay rent.

This is going to keep happening

The result isn’t too surprising if you dig into the token-based predictive models that power these kinds of chatbots. MyCity’s Microsoft Azure-powered chatbot uses a complex process of statistical associations across millions of tokens to essentially guess at the most likely next word in any given sequence, without any real understanding of the underlying information being conveyed.

That can cause problems when a single factual answer to a question might not be reflected precisely in the training data. In fact, The Markup said that at least one of its tests resulted in the correct answer on the same query about accepting Section 8 housing vouchers (even as “ten separate Markup staffers” got the incorrect answer when repeating the same question).

The MyCity Chatbot—which is prominently labeled as a “Beta” product—tells users who bother to read the warnings that it “may occasionally produce incorrect, harmful or biased content” and that users should “not rely on its responses as a substitute for professional advice.” But the page also states front and center that it is “trained to provide you official NYC Business information” and is being sold as a way “to help business owners navigate government.”

Andrew Rigie, executive director of the NYC Hospitality Alliance, told The Markup that he had encountered inaccuracies from the bot himself and had received reports of the same from at least one local business owner. But NYC Office of Technology and Innovation Spokesperson Leslie Brown told The Markup that the bot “has already provided thousands of people with timely, accurate answers” and that “we will continue to focus on upgrading this tool so that we can better support small businesses across the city.”

NYC Mayor Eric Adams touts the MyCity chatbot in an October announcement event.

The Markup’s report highlights the danger of governments and corporations rolling out chatbots to the public before their accuracy and reliability have been fully vetted. Last month, a court forced Air Canada to honor a fraudulent refund policy invented by a chatbot available on its website. A recent Washington Post report found that chatbots integrated into major tax preparation software provides “random, misleading, or inaccurate … answers” to many tax queries. And some crafty prompt engineers have reportedly been able to trick car dealership chatbots into accepting a “legally binding offer – no take backsies” for a $1 car.

These kinds of issues are already leading some companies away from more generalized LLM-powered chatbots and toward more specifically trained Retrieval-Augmented Generation models, which have been tuned only on a small set of relevant information. That kind of focus could become that much more important if the FTC is successful in its efforts to make chatbots liable for “false, misleading, or disparaging” information.

Source link

Hamas chief Haniyeh discusses Gaza truce talks with Egypt, Qatar officials | Israel War on Gaza News

Google brings passkey support to its Advanced Protection Program ahead of the US presidential election

Apple changes controversial EU Core Technology fee again, and confirms major App Store changes are coming to iPad

Best Sonos Setup (2024): Which Speakers and Soundbars Should You Buy?

Peter Andre finally confirms baby daughter’s sweet name

One UI 6.1 update with Galaxy AI rolls out to Galaxy S21 series, Galaxy Z Fold 4, Flip 4, Fold 3, and Flip 3

Loopy Pro Review: The Best iPad Music Recording Software

One UI 6.1 now rolling out to Galaxy S22 and S21 series, Fold4, Fold3, Flip4, Flip3

NASA says Artemis II report by its inspector general is unhelpful and redundant

‘Greater than an encampment’: Why Gaza student protests strike a chord | Israel War on Gaza News

NYC’s government chatbot is lying about city laws and regulations

This is going to keep happening

Real Hacker Staff

How to get & use jet-pack in Lethal Company

Cinematic Experiences Through Headphones Inside the Car Using Cingo Technology from Fraunhofer IIS

Twitch viewers baffled as new ‘topless’ meta goes viral on platform

Rocket Report: Beyond Gravity to study fairing reuse; North Korea launches satellite

5 things we learned from the Epic-Google antitrust case this week

Hamas chief Haniyeh discusses Gaza truce talks with Egypt, Qatar officials | Israel War on Gaza News

Quordle today – hints and answers for Sunday, November 5 (game #650)

Logan Paul wins first WWE championship title against wrestling legend Rey Mysterio

Diablo 4 Devs Drop a Big Hint About the Highly Sought-After Cow Level

Boxer Félix Verdejo sentenced to life in prison for grisly killing of pregnant lover

‘Biden, you can’t hide’: Tens of thousands march in US for Gaza ceasefire | Israel-Palestine conflict News

This is going to keep happening

New Gemini Android update adds automatic Google Maps navigation

Game Of Thrones Creators Reveal Their Favorite Deaths

Related Articles

Hamas chief Haniyeh discusses Gaza truce talks with Egypt, Qatar officials | Israel War on Gaza News

Quordle today – hints and answers for Sunday, November 5 (game #650)

Logan Paul wins first WWE championship title against wrestling legend Rey Mysterio

Diablo 4 Devs Drop a Big Hint About the Highly Sought-After Cow Level

Boxer Félix Verdejo sentenced to life in prison for grisly killing of pregnant lover

‘Biden, you can’t hide’: Tens of thousands march in US for Gaza ceasefire | Israel-Palestine conflict News