New benchmarking tool evaluates the factuality of LLMs

A team of AI researchers and computer scientists from Cornell University, the University of Washington and the Allen Institute for Artificial Intelligence has developed a benchmarking tool called WILDHALLUCINATIONS to evaluate the factuality of multiple large language models (LLMs). The group has published a paper describing the factors that went into creating their tool on the arXiv preprint server.

OpenAI unveils sCM, a new model that generates video media 50 times faster than current diffusion models

techxplore.com - 24/Oct 14:25

Two experts with the OpenAI team have developed a new kind of continuous-time consistency model (sCM) that they claim can generate video media 50...

Scientists Achieve 1,400-Second Quantum Coherence in Schrödinger-Cat State

thequantumdaily.com - 31/Oct 08:57

Insider Brief A recent study published on the preprint site ArXiv has demonstrated a Schrödinger-cat state — a type of nonclassical quantum state...

Simplified octopus-inspired swimming robot with soft asymmetric arms can replicate swimming patterns

techxplore.com - 29/Oct 14:08

Researchers at the National University of Singapore have developed a new robot inspired by one of the most intelligent aquatic animals on Earth: the...

AI model that checks for skin cancer shows promise

oncologynews.com.au - 30/Oct 14:13

Scientists in the East of England have developed a way of using artificial intelligence to check for skin cancer, with the AI tool outperforming...

Revolutionary Sponge Extracts Gold from E-Waste

greekreporter.com - 30/Oct 09:45

A team of scientists has developed a type of sponge made of graphene oxide and chitosan, that can be used to extract gold from electronic waste....

Study shows AI can be fine-tuned for political bias

techxplore.com - 22/Oct 16:26

In an era where artificial intelligence is playing a growing role in shaping political narratives and public discourse, researchers have developed a...

Quantum Scientists Say Better Portfolio Management Might be in The (Decomposition) Pipeline

thequantumdaily.com - 10:07

Insider Brief Researchers report that a new method may help quantum computers one day tackle the complexity of large-scale portfolio optimization,...

‘Let the big boys in the Valley do it’: Nandan Nilekani of Infosys on AI LLMs

hindustantimes.com - 25/Oct 05:24

Infosys co-founder and non-executive chairman Nandan Nilekani said India should focus on building AI use cases than on building large language models...

New satellite tool picks out plastic on sand from more than 600 kilometres above

aumanufacturing.com.au - 30/Oct 22:42

A satellite imagery tool developed by RMIT University scientists and able to spot plastic rubbish beaches has been successfully field tested on a...

Is it AI? Peer reviewers struggle to distinguish LLMs from human writing

techxplore.com - 31/Oct 20:36

Large language models (LLMs) such as ChatGPT have grown so advanced that they can even pass the US Medical Licensing Exam. But how good are peer...

Rubriques :