Apple News | Slashdot

Huge Math Error Corrected In Black Plastic Study (arstechnica.com) 105

Posted by BeauHD on Tuesday December 17, 2024 @03:00AM from the oops dept.

Ars Technica's Beth Mole reports: Editors of the environmental chemistry journal Chemosphere have posted an eye-catching correction to a study reporting toxic flame retardants from electronics wind up in some household products made of black plastic, including kitchen utensils. The study sparked a flurry of media reports a few weeks ago that urgently implored people to ditch their kitchen spatulas and spoons. Wirecutter even offered a buying guide for what to replace them with. The correction, posted Sunday, will likely take some heat off the beleaguered utensils. The authors made a math error that put the estimated risk from kitchen utensils off by an order of magnitude.

Specifically, the authors estimated that if a kitchen utensil contained middling levels of a key toxic flame retardant (BDE-209), the utensil would transfer 34,700 nanograms of the contaminant a day based on regular use while cooking and serving hot food. The authors then compared that estimate to a reference level of BDE-209 considered safe by the Environmental Protection Agency. The EPA's safe level is 7,000 ng -- per kilogram of body weight -- per day, and the authors used 60 kg as the adult weight (about 132 pounds) for their estimate. So, the safe EPA limit would be 7,000 multiplied by 60, yielding 420,000 ng per day. That's 12 times more than the estimated exposure of 34,700 ng per day. However, the authors missed a zero and reported the EPA's safe limit as 42,000 ng per day for a 60 kg adult. The error made it seem like the estimated exposure was nearly at the safe limit, even though it was actually less than a tenth of the limit. "We regret this error and have updated it in our manuscript," the authors said in a correction.

"This calculation error does not affect the overall conclusion of the paper," the correction reads. The study maintains that flame retardants "significantly contaminate" the plastic products, which have "high exposure potential."

Microsoft Announces Phi-4 AI Model Optimized for Accuracy and Complex Reasoning (computerworld.com) 31

Posted by EditorDavid on Monday December 16, 2024 @01:34AM from the Bing-bot's-brother dept.

An anonymous reader shared this report from Computerworld: Microsoft has announced Phi-4 — a new AI model with 14 billion parameters — designed for complex reasoning tasks, including mathematics. Phi-4 excels in areas such as STEM question-answering and advanced problem-solving, surpassing similar models in performance. Phi-4, part of the Phi small language models (SLMs), is currently available on Azure AI Foundry under the Microsoft Research License Agreement and will launch on Hugging Face [this] week, the company said in a blog post.

The company emphasized that Phi-4's design focuses on improving accuracy through enhanced training and data curation.... "Phi-4 outperforms comparable and even larger models on tasks like mathematical reasoning, thanks to a training process that combines synthetic datasets, curated organic data, and innovative post-training techniques," Microsoft said in its announcement. The model leverages a new training approach that integrates multi-agent prompting workflows and data-driven innovations to enhance its reasoning efficiency. The accompanying report highlights that Phi-4 balances size and performance, challenging the industry norm of prioritizing larger models... Phi-4 achieved a score of 80.4 on the MATH benchmark and has surpassed other systems in problem-solving and reasoning evaluations, according to the technical report accompanying the release. This makes it particularly appealing for domain-specific applications requiring precision, like scientific computation or advanced STEM problem-solving.

Microsoft emphasized its commitment to ethical AI development, integrating advanced safety measures into Phi-4. The model benefits from Azure AI Content Safety features such as prompt shields, protected material detection, and real-time application monitoring. These features, Microsoft explained, help users address risks like adversarial prompts and data security threats during AI deployment. The company also reiterated that Azure AI Foundry, the platform hosting Phi-4, offers tools to measure and mitigate AI risks. Developers using the platform can evaluate and improve their models through built-in metrics and custom safety evaluations, Microsoft added... With Phi-4, Microsoft continues to evolve its AI offerings while promoting responsible use through robust safeguards. Industry watchers will observe how this approach shapes adoption in critical fields where reasoning and security are paramount.

Are People Starting to Love Self-Driving Robotaxis? (marketplace.org) 106

Posted by EditorDavid on Sunday December 15, 2024 @01:34PM from the carried-away dept.

"In a tiny handful of places..." Wired wrote last month, "you can find yourself flanked by taxis with no one in the drivers' seats." But they added that "Granted, practically everyone has been numbed by the hype cycle."

Wired's response? "[P]ile a few of us into an old-fashioned, human-piloted hired car, then follow a single Waymo robotaxi wherever it goes for a whole workday" to "study its movements, its relationship to life on the streets, its whole self-driving gestalt. We'll interview as many of its passengers as will speak to us, and observe it through the eyes of the kind of human driver it's designed to replace."

This week Wired senior editor John Gravios discussed the experience on the business-news radio show Marketplace (with Marketplace host Kai Ryssdal): Ryssdal: What kinds of reactions did you get from people once you track them down, what did they say about their experience in this driverless car?

Gravios:It was pretty uniform and impressive how much people just love it. They just like the experience of the drive, I guess it's a little bit less herky-jerky than a human driver, but I think a lot of it just comes down to people are just kind of relieved not to have to talk to somebody else, as as sad as that is...

Ryssdal: Tell me about Gabe, your Uber driver, and his thoughts on this whole thing, because that was super interesting.

Gravios: So Gabe, this is a guy whose labor is directly at stake. You know, he's a guy whose labor is going to be replaced by a Waymo. He's had 30 years of experience as a professional driver, first as a taxi driver. He even organized a taxi driver strike in the days before Uber. His first, I think his prejudice with Waymo is having shared the road with them sort of sporadically, he thought of them as kind of dopey, rule-following, frustrating vehicles to share the road with. But over the course of the day, he started to recognize that the Waymo was driving a lot like a taxi driver. The Waymo was doing things that were aggressive, that are exactly the kinds of things that a taxi driver is trained to be aggressive with and doing things that were cautious that are exactly the kinds of things that taxi drivers are trained to be cautious with.

Ryssdal: Can we talk unit economics here? According to the math from a study you guys' cite, Waymo is not making a whole lot of money per vehicle, right? And eventually they're going to scale, and it's going to work out, but for the moment, even though they've gotten 11 billion-something-dollars, they're not turning a whole lot of profit here.

Gravios: Yeah, that's a big question, and the math is, even that study, based on a lot of guesswork. It's really hard to say what the unit economics are. What we can say is that the ridership rates are going up so fast that that study is already well out of date. When we were doing our chase, I think the monthly ridership for Waymo was 100,000 rides a month. By October, it was already 150,000 rides a month. So, the economics are just shifting under our feet a lot.

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft (wired.com) 27

Posted by msmash on Thursday December 12, 2024 @03:35AM from the moving-forward dept.

OpenAI Releases 'Smarter, Faster' ChatGPT - Plus $200-a-Month Subscriptions for 'Even-Smarter Mode' (venturebeat.com) 64

Posted by EditorDavid on Thursday December 05, 2024 @10:34PM from the machines-learning dept.

Wednesday OpenAI CEO Sam Altman announced "12 Days of OpenAI," promising that "Each weekday, we will have a livestream with a launch or demo..." And sure enough, today he announced the launch of two things:

- "o1, the smartest model in the world. Smarter, faster, and more features (e.g. multimodality) than o1-preview. Live in ChatGPT now, coming to API soon."

- "ChatGPT Pro. $200/month. Unlimited usage and even-smarter mode for using o1. More benefits to come!"

Altman added this update later: For extra clarity: o1 is available in our plus tier, for $20/month. With the new pro tier ($200/month), it can think even harder for the hardest problems. Most users will be very happy with o1 in the plus tier!
VentureBeat points out that subscribers "also gain access to GPT-4o, known for its advanced natural language generation capabilities, and the Advanced Voice feature for speech-based interactions."

And even for non-subscribers, ChatGPT can now also analyze images, points out VentureBeat, "a hugely helpful feature upgrade as it enables users to upload photos and have the AI chatbot respond to them, giving them detailed plans on how to build a birdhouse entirely from a single candid photo of one, for one fun example." In another, potentially more serious and impressive example, it is now capable of helping design data centers from sketches... o1 represents a significant evolution in reasoning model capabilities, including better handling of complex tasks, image-based reasoning, and enhanced accuracy. Enterprise and Education users will gain access to the model next week... OpenAI's updates also include safety enhancements, with the o1-preview scoring 84 on a rigorous safety test, compared to 22 for its predecessor...

To encourage the use of AI in societal-benefit fields, OpenAI has announced the ChatGPT Pro Grant Program. The initiative will initially award 10 grants to leading medical researchers, providing free access to ChatGPT Pro tools.
In a video Altman displays graphs showing o1 dramatically outperforms gpt4o on math questions, on competition coding at CodeForces, and on PhD-level science questions.

The New Climate Math on Hurricanes 136

Posted by msmash on Friday November 29, 2024 @01:00PM from the state-of-things dept.

Does Casio's New Calculator Watch Take You Back To 6th Grade Math Class? (techspot.com) 78

Posted by EditorDavid on Saturday November 16, 2024 @01:34PM from the 07734 dept.

AI Systems Solve Just 2% of Advanced Maths Problems in New Benchmark Test 82

Posted by msmash on Wednesday November 13, 2024 @03:22PM from the reality-check dept.

Australian Mathematicians Debunk 'Infinite Monkey Theorem' 124

Posted by msmash on Friday November 01, 2024 @10:01AM from the tough-luck dept.

Former Nvidia Engineer Discovers 41-Million-Digit Prime (tomshardware.com) 29

Posted by BeauHD on Friday October 25, 2024 @08:02PM from the waiting-to-be-discovering dept.

Anthropic's AI Can Now Run And Write Code (techcrunch.com) 23

Posted by msmash on Friday October 25, 2024 @04:01PM from the pushing-the-limits dept.

Physicist Reveals Why You Should Run in The Rain (sciencealert.com) 116

Posted by BeauHD on Wednesday October 23, 2024 @06:00AM from the common-dilemmas dept.

Theoretical Physicist Jacques Treiner, from the University of Paris Cite, explains why you should run in the rain: ... Let p represent the number of drops per unit volume, and let a denote their vertical velocity. We'll denote Sh as the horizontal surface area of the individual (e.g., the head and shoulders) and Sv as the vertical surface area (e.g., the body). When you're standing still, the rain only falls on the horizontal surface, Sh. This is the amount of water you'll receive on these areas. Even if the rain falls vertically, from the perspective of a walker moving at speed v, it appears to fall obliquely, with the angle of the drops' trajectory depending on your speed. During a time period T, a raindrop travels a distance of aT. Therefore, all raindrops within a shorter distance will reach the surface: these are the drops inside a cylinder with a base of Sh and a height of aT, which gives:
p.Sh.a.T.

As we have seen, as we move forward, the drops appear to be animated by an oblique velocity that results from the composition of velocity a and velocity v. The number of drops reaching Sh remains unchanged, since velocity v is horizontal and therefore parallel to Sh. However, the number of drops reaching surface Sv -- which was previously zero when the walker was stationary -- has now increased. This is equal to the number of drops contained within a horizontal cylinder with a base area of Sv and a length of v.T. This length represents the horizontal distance the drops travel during this time interval. In total, the walker receives a number of drops given by the expression:
p.(Sh.a + Sv.v). T

Now we need to take into account the time interval during which the walker is exposed to the rain. If you're covering a distance d at constant speed v, the time you spend walking is d/v. Plugging this into the equation, the total amount of water you encounter is:
p.(Sh.a + Sv.v). d/v = p.(Sh.a/v + Sv). d This equation proves that the faster you move, the less water hits your head and shoulders, but the amount of water hitting the vertical part of your body remains constant. To stay drier, it's best to move quickly and lean forward. However, you'll have to increase your speed to offset the exposed surface area caused by leaning.

A Calculator's Most Important Button Has Been Removed (theatlantic.com) 108

Posted by msmash on Monday October 21, 2024 @03:22PM from the closer-look dept.

52nd Known Mersenne Prime Found (mersenne.org) 61

Posted by msmash on Monday October 21, 2024 @12:01PM from the more-you-know dept.

'A Nobel For the Big Big Questions' (noahpinion.blog) 15

Posted by msmash on Wednesday October 16, 2024 @01:20PM from the closer-look dept.

In a rather critical analysis of the 2024 Economics Nobel, commentator Noah Smith has questioned the prize's shift back to "big-think" theories. He argues that Acemoglu, Johnson, and Robinson's (the winner of the 2024 Economics Nobel) influential work on institutions and development, while intriguing, lacks robust empirical validation. From his blog: The science prizes rely very heavily on external validity to determine who gets the prize -- your theory or your invention has to work, basically. If it doesn't, you can be the biggest genius in the world, but you'll never get a Nobel. The physicist Ed Witten won a Fields Medal, which is even harder to get than a Nobel, for the math he invented for string theory. But he'll almost certainly never get a Physics Nobel, because string theory can't be empirically tested.

The Econ Nobel is different. Traditionally, it's given to economists whose ideas are most influential within the economics profession. If a whole bunch of other economists do research that follows up on your research, or which uses theoretical or empirical techniques you pioneered, you get an Econ Nobel. Your theory doesn't have to be validated, your specific empirical findings can already have been overturned by the time the prize is awarded, but if you were influential, you get the prize.

You could argue that this is appropriate for what Thomas Kuhn would call a "pre-paradigmatic" science -- a field that's still looking for a set of basic concepts and tools. But it's been 55 years since they started giving the prize, and that seems like an awfully long time for a field to still be tooling up. Meanwhile, making "influence within the economics profession" the criterion for successful research seems a little too much like a popularity contest. It's how you end up with prizes like the one in 2004, which was given to some macroeconomic theorists whose theory said that recessions are caused by technological slowdowns and that mass unemployment is a voluntary vacation.

In recent years, that looked like it might be changing. Often, the prize was given to empirical economists associated with the so-called "credibility revolution" -- basically, quasi-experiments. Those cases include Goldin in 2023, Card/Angrist/Imbens in 2021, and Banerjee/Duflo/Kremer in 2019. And when it was given to theorists, they tended to be game theorists whose theories are very predictive of real-world outcomes -- Milgrom/Wilson in 2020, Hart/Holmstrom in 2016, Tirole in 2014, and Roth/Shapley in 2012. Even when the prize was given to macro -- a field where validity is much harder to establish -- it was given to economists whose theories have seen immediate application to pressing problems of the day, such as Bernanke/Diamond/Dybvig in 2022 and Nordhaus in 2018. In other words, the recent Nobels have made it seem like economics might be becoming more like a natural science, where practical applications and external validity are the ultimate arbiter of the value of research, rather than cultural influence within the economics profession. But this year's prize seems like a step away from that, and back toward the sort of big-think that used to be more popular in the prize's early years.

Study Done By Apple AI Scientists Proves LLMs Have No Ability to Reason (appleinsider.com) 233

Posted by EditorDavid on Sunday October 13, 2024 @05:48PM from the being-reasonable dept.

Researchers Claim New Technique Slashes AI Energy Use By 95% (decrypt.co) 115

Posted by BeauHD on Tuesday October 08, 2024 @11:30PM from the would-you-look-at-that dept.

Researchers at BitEnergy AI, Inc. have developed Linear-Complexity Multiplication (L-Mul), a technique that reduces AI model power consumption by up to 95% by replacing energy-intensive floating-point multiplications with simpler integer additions. This method promises significant energy savings without compromising accuracy, but it requires specialized hardware to fully realize its benefits. Decrypt reports: L-Mul tackles the AI energy problem head-on by reimagining how AI models handle calculations. Instead of complex floating-point multiplications, L-Mul approximates these operations using integer additions. So, for example, instead of multiplying 123.45 by 67.89, L-Mul breaks it down into smaller, easier steps using addition. This makes the calculations faster and uses less energy, while still maintaining accuracy. The results seem promising. "Applying the L-Mul operation in tensor processing hardware can potentially reduce 95% energy cost by element wise floating point tensor multiplications and 80% energy cost of dot products," the researchers claim. Without getting overly complicated, what that means is simply this: If a model used this technique, it would require 95% less energy to think, and 80% less energy to come up with new ideas, according to this research.

The algorithm's impact extends beyond energy savings. L-Mul outperforms current 8-bit standards in some cases, achieving higher precision while using significantly less bit-level computation. Tests across natural language processing, vision tasks, and symbolic reasoning showed an average performance drop of just 0.07% -- a negligible tradeoff for the potential energy savings. Transformer-based models, the backbone of large language models like GPT, could benefit greatly from L-Mul. The algorithm seamlessly integrates into the attention mechanism, a computationally intensive part of these models. Tests on popular models such as Llama, Mistral, and Gemma even revealed some accuracy gain on certain vision tasks.

At an operational level, L-Mul's advantages become even clearer. The research shows that multiplying two float8 numbers (the way AI models would operate today) requires 325 operations, while L-Mul uses only 157 -- less than half. "To summarize the error and complexity analysis, L-Mul is both more efficient and more accurate than fp8 multiplication," the study concludes. But nothing is perfect and this technique has a major achilles heel: It requires a special type of hardware, so the current hardware isn't optimized to take full advantage of it. Plans for specialized hardware that natively supports L-Mul calculations may be already in motion. "To unlock the full potential of our proposed method, we will implement the L-Mul and L-Matmul kernel algorithms on hardware level and develop programming APIs for high-level model design," the researchers say.

Stephen Hawking Was Wrong - Extremal Black Holes Are Possible (quantamagazine.org) 44

Posted by EditorDavid on Sunday September 15, 2024 @06:23PM from the event-horizons dept.

"Even black holes have edge cases," writes Astronomy magazine contributing editor Steve Nadis, in an article in Quanta magazine (republished today by Wired): Black holes rotate in space. As matter falls into them, they start to spin faster; if that matter has charge, they also become electrically charged. In principle, a black hole can reach a point where it has as much charge or spin as it possibly can, given its mass. Such a black hole is called "extremal" — the extreme of the extremes. These black holes have some bizarre properties. In particular, the so-called surface gravity at the boundary, or event horizon, of such a black hole is zero. "It is a black hole whose surface doesn't attract things anymore," said Carsten Gundlach, a mathematical physicist at the University of Southampton. But if you were to nudge a particle slightly toward the black hole's center, it would be unable to escape.

In 1973, the prominent physicists Stephen Hawking, James Bardeen and Brandon Carter asserted that extremal black holes can't exist in the real world — that there is simply no plausible way that they can form. Nevertheless, for the past 50 years, extremal black holes have served as useful models in theoretical physics. "They have nice symmetries that make it easier to calculate things," said Gaurav Khanna of the University of Rhode Island, and this allows physicists to test theories about the mysterious relationship between quantum mechanics and gravity. Now two mathematicians have proved Hawking and his colleagues wrong. The new work — contained in a pair of recent papers by Christoph Kehle of the Massachusetts Institute of Technology and Ryan Unger of Stanford University and the University of California, Berkeley — demonstrates that there is nothing in our known laws of physics to prevent the formation of an extremal black hole.

Their mathematical proof is "beautiful, technically innovative and physically surprising," said Mihalis Dafermos, a mathematician at Princeton University (and Kehle's and Unger's doctoral adviser). It hints at a potentially richer and more varied universe in which "extremal black holes could be out there astrophysically," he added. That doesn't mean they are. "Just because a mathematical solution exists that has nice properties doesn't necessarily mean that nature will make use of it," Khanna said. "But if we somehow find one, that would really [make] us think about what we are missing." Such a discovery, he noted, has the potential to raise "some pretty radical kinds of questions." Before Kehle and Unger's proof, there was good reason to believe that extremal black holes couldn't exist.
Hawking, Bardeen, and Carter believed there was no way an extremal black hole could form, according to the article, and "in 1986, a physicist named Werner Israel seemed to put the issue to rest."

But the two mathematicians, studying the formation of electrically charged black holes, stumbled into a counterexample — and along the way "also constructed two other solutions to Einstein's equations of general relativity that involved different ways of adding charge to a black hole. Having disproved Bardeen, Carter and Hawking's hypothesis in three different contexts, the work should leave no doubt, Unger said... "This is a beautiful example of math giving back to physics," said Elena Giorgi, a mathematician at Columbia University....

In the meantime, a better understanding of extremal black holes can provide further insights into near-extremal black holes, which are thought to be plentiful in the universe. "Einstein didn't think that black holes could be real [because] they're just too weird," Khanna said. "But now we know the universe is teeming with black holes."

For similar reasons, he added, "we shouldn't give up on extremal black holes. I just don't want to put limits on nature's creativity."

OpenAI Releases o1, Its First Model With 'Reasoning' Abilities 108

Posted by msmash on Thursday September 12, 2024 @01:17PM from the moving-forward dept.

College Grades Have Become a Charade. It's Time To Abolish Them. (msn.com) 234

Posted by msmash on Friday September 06, 2024 @02:13PM from the how-about-that dept.

When most students get As, grading loses all meaning as a way to encourage exceptional work and recognize excellence. From a report: Grade inflation at American universities is out of control. The statistics speak for themselves. In 1950, the average GPA at Harvard was estimated at 2.6 out of 4. By 2003, it had risen to 3.4. Today, it stands at 3.8. The more elite the college, the more lenient the standards. At Yale, for example, 80% of grades awarded in 2023 were As or A minuses. But the problem is also prevalent at less selective colleges. Across all four-year colleges in the U.S., the most commonly awarded grade is now an A. Some professors and departments, especially in STEM disciplines, have managed to uphold more stringent criteria. A few advanced courses attract such a self-selecting cohort of students that virtually all of them deserve recognition for genuinely excellent work. But for the most part, the grading scheme at many institutions has effectively become useless. An A has stopped being a mark of special academic achievement.

If everyone outside hard-core engineering, math or pre-med courses can easily get an A, the whole system loses meaning. It fails to make distinctions between different levels of achievement or to motivate students to work hard on their academic pursuits. All the while, it allows students to pretend -- to themselves and to others -- that they are performing exceptionally well. Worse, this system creates perverse incentives. To name but one, it actively punishes those who take risks by enrolling in truly challenging courses. All of this contributes to the strikingly poor record of American colleges in actually educating their students. As Richard Arum and Josipa Roksa showed in their 2011 book "Academically Adrift," the time that the average full-time college student spent studying dropped by half in the five decades after 1960, falling to about a dozen hours a week. A clear majority of college students "showed no significant progress on tests of critical thinking, complex reasoning and writing," with about half failing to make any improvements at all in their first two years of higher education.

2014	U.S. Supreme Court Upholds Religious Objections To Contraception	1330 comments
2007	Google Protects Healthcare From Michael Moore	1153 comments
2005	Justice O'Connor Retiring	1157 comments
2004	Linux vs. Windows: What's The Difference?	1219 comments
2003	Bill Gates On Linux	1194 comments

Slashdot Top Deals