Skip to main content

Slashdot: 'How Many AIs Does It Take To Read a PDF?'

'How Many AIs Does It Take To Read a PDF?'
Published on February 24, 2026 at 12:20AM
Despite AI's progress in building complex software, the ubiquitous PDF remains something of a grand challenge -- a format Adobe developed in the early 1990s to preserve the precise visual appearance of documents. PDFs consist of character codes, coordinates, and rendering instructions rather than logically ordered text, and even state-of-the-art models asked to extract information from them will summarize instead, confuse footnotes with body text, or outright hallucinate contents, The Verge writes. Companies like Reducto are now tackling the problem by segmenting pages into components -- headers, tables, charts -- before routing each to specialized parsing models, an approach borrowed from computer vision techniques used in self-driving vehicles. Researchers at Hugging Face recently found roughly 1.3 billion PDFs sitting in Common Crawl alone, and the Allen Institute for AI has noted that PDFs could provide trillions of novel, high-quality training tokens from government reports, textbooks, and academic papers -- the kind of data AI developers are increasingly desperate for.

Read more of this story at Slashdot.

Comments

Popular posts from this blog

Slashdot: AT&T Now Lets Customers Lock Down Account To Prevent SIM Swapping Attacks

AT&T Now Lets Customers Lock Down Account To Prevent SIM Swapping Attacks Published on July 02, 2025 at 01:30AM AT&T has launched a new Account Lock feature designed to protect customers from SIM swapping attacks. The security tool, available through the myAT&T app, prevents unauthorized changes to customer accounts including phone number transfers, SIM card changes, billing information updates, device upgrades, and modifications to authorized users. SIM swapping attacks occur when criminals obtain a victim's phone number through social engineering techniques, then intercept messages and calls to access two-factor authentication codes for sensitive accounts. The attacks have become increasingly common in recent years. AT&T began gradually rolling out Account Lock earlier this year, joining T-Mobile, Verizon, and Google Fi, which already offer similar fraud prevention features. Read more of this story at Slashdot.

Slashdot: AT&T Outlines $250 Billion US Investment Plan To Boost Infrastructure In AI Age

AT&T Outlines $250 Billion US Investment Plan To Boost Infrastructure In AI Age Published on 2026-03-10T20:00:00Z AT&T plans to invest more than $250 billion over the next five years to expand U.S. telecom infrastructure for the AI age. The company says it will also hire thousands of technicians while partnering with AST SpaceMobile to extend coverage to remote areas. Reuters reports: Rapid adoption of artificial intelligence, cloud computing and connected devices has prompted telecom operators to invest heavily in fiber and 5G networks as they also seek to fend off intensifying competition from cable broadband providers. AT&T, which has about 110,000 employees in the U.S., said the new hires will help build and maintain its infrastructure. The outlay includes capital expenditure and other spending, the company said. The spending will focus on expanding its fiber and wireless networks, including accelerating deployment of fiber broadband, 5G home internet and satellite co...

Slashdot: Wells Fargo CEO Says More Job Cuts Coming at the Bank as AI Prompts 'Efficiency'

Wells Fargo CEO Says More Job Cuts Coming at the Bank as AI Prompts 'Efficiency' Published on December 11, 2025 at 12:15AM Wells Fargo expects more job cuts and higher severance costs in this quarter that ends in three weeks, bank CEO and President Charlie Scharf said Tuesday at an investors conference in New York. He's also betting on AI to drive efficiency and, eventually, further workforce reduction.From a report: "As we've gone through the budgeting process, and even pre AI, we do expect to have less people as we go into next year," Scharf said at the Goldman Sachs Financial Services Conference in New York City. "We'll likely have more severance in the fourth quarter." The fourth quarter runs Oct. 1 through Dec. 31 for the San Francisco-basaed bank. Wells Fargo already has shrunk from 275,000 employees to about 210,000 since Scharf joined the bank in 2019 -- about a 24% decrease. Its largest employee base remains in Charlotte, with about 27...