Big Mac Benchmark Drops to 7.4 TFlops 417

Posted by CowboyNeal on Wednesday October 22, 2003 @03:09PM from the number-adjusting dept.

coolmacdude writes "Well it seems that the early estimates were a bit overzealous. According to preliminary test results (in postscript format) on the full range of CPUs at Virginia Tech, the Rmax score on Linpack comes in at around 7.4 TFlops. This puts it at number four on the Top 500 List. It also represents an efficiency of about 44 percent, down from the previous result of 80 achieved on a subset of the computers. Perhaps in light of this, apparantly VT is now planning to devote an additional two months to improve the stability and efficiency of the system before any research can begin. While these numbers will no doubt come as a disappointment for Mac zealots who wanted to blow away all the Intel machines, it should still be noted that this is the best price/performance ratio ever achieved on a supercomputer. In addition, the project was successful at meeting VT's goal of developing an inexpensive top 5 machine. The results have also been posted at Ars Technica's openforum."

This discussion has been archived. No new comments can be posted.

Big Mac Benchmark Drops to 7.4 TFlops

Load All Comments

Search 417 Comments Log In/Create an Account

Comments Filter:

A supercomputer by Any Other Name.... (Score:5, Interesting)

by bluethundr ( 562578 ) * writes: on Wednesday October 22, 2003 @03:09PM (#7283258) Homepage Journal

I've always been sort of intrigued by ,a href="http://www.top500.org/">Top500. Has there ever been a good comparison written about the similarities/differences between a 'supercomputer' and the lowly pc sitting on my desk running Linux/XP? At what point does the computer in question earn the title "Super"?

Share
twitter facebook
- My computer is SUPER!!! (Score:2, Funny)
  
  by Anonymous Coward writes: on Wednesday October 22, 2003 @03:16PM (#7283332)
  
  Thanks for asking!!
  
  Parent Share
  twitter facebook
- Re:A supercomputer by Any Other Name.... (Score:3, Insightful)
  
  by Carnildo ( 712617 ) writes: on Wednesday October 22, 2003 @03:24PM (#7283425) Homepage Journal
  
  The big difference is that a "supercomputer" is usually heavily optimized towards vector operations: performing the same operation on many data elements at once. Think of it as SIMD (MMX, SSE, etc), only more so. A "supercomputer" would be pretty useless at ordinary tasks such as web browsing or word processing, as those can't be vectorized or parallelized very well. A "supercomputer" might be good as a graphics or physics engine for gaming, but that's sort of like using a cannon to swat a fly: a lot of work for something that can be done with a simple flyswatter.
  
  Parent Share
  twitter facebook
  - - Re:A supercomputer by Any Other Name.... (Score:3, Funny)
      
      by cosmo7 ( 325616 ) writes: on Wednesday October 22, 2003 @06:36PM (#7285301) Homepage
      
      supercomputer pronunciation key(spr-km-pytr)
      n.
      A mainframe computer that, as the result of birth on an alien planet, is impervious to bullets, is capable of flight, has x-ray vision, can run faster than a speeding train, etc.
      
      "Is it a bird? Is it a plane? No it's a Cray XM-P!"
      - Seymour Fights The Demon World, Action Comics, 1932
      
      Source: The American Heritage(R) Dictionary of the English Language, Fourth Edition
      Copyright (C) 2000 by Houghton Mifflin Company.
      Published by Houghton Mifflin Company. All rights reserved.
      
      Parent Share
      twitter facebook
- Re:A supercomputer by Any Other Name.... (Score:4, Funny)
  
  by BostonPilot ( 671668 ) writes: on Wednesday October 22, 2003 @03:32PM (#7283496) Homepage
  
  Nah, the real defintions is:
  Super computers cost more than 5 million dollars
  Mainframes cost more than 1 million dollars
  Mini-Super computers cost more than 1/4 million dollars
  Everything else is by definition a Plain Jane (TM) computer
  btw, I've worked on all 4 kinds ;-)
  
  Parent Share
  twitter facebook
- From the horse's mouth (Score:5, Interesting)
  
  by Jungle guy ( 567570 ) writes: <brunolmailbox-ge ... .br minus author> on Wednesday October 22, 2003 @03:56PM (#7283718) Journal
  
  Jack Dongarra says that a "supercomputer" is simply a computer that, for todays's standards, is REALLY fast. I saw one presentation from him, and he said he run the Linpack benchmark on his notebook (2.4 GHz Pentium 4) and it would get to the bottom of the Top500 list in 1992. So, this supercomputer definition is very fluid.
  
  Parent Share
  twitter facebook
snazzy new G5 logo too! (Score:4, Funny)

by JUSTONEMORELATTE ( 584508 ) writes: on Wednesday October 22, 2003 @03:10PM (#7283270) Homepage

Way to go /. -- updated the logo from G4 to G5 just in time.

--

Share
twitter facebook
- Re:snazzy new G5 logo too! (Score:2, Funny)
  
  by t0ny ( 590331 ) writes: on Wednesday October 22, 2003 @04:50PM (#7284303)
  
  While these numbers will no doubt come as a disappointment for Mac zealots who wanted to blow away all the Intel machines, it should still be noted that this is the best price/performance ratio ever achieved on a supercomputer.
  Way to go there; lets just keep encouraging their terrorism.
  
  Parent Share
  twitter facebook
Important items of note (Score:5, Informative)

by daveschroeder ( 516195 ) * writes: on Wednesday October 22, 2003 @03:10PM (#7283274)

It's worth noting a few important things:

First, from a an Oct 22 New York Times [nytimes.com] story:

Officials at the school said that they were still finalizing their results and that the final speed number might be significantly higher.

This will likely be the case.

Second, they're only 0.224 Tflops away from the only Intel-based cluster above it. So saying "all the Intel machines" in the story is kind of inaccurate, as if there are all kinds of Intel-based clusters that will still be faster; there is only one Intel-based cluster above it, and with only preliminary numbers for the Virgina Tech cluster at that.

Third, this figure is with around 2112 processors, not the full 2200 processors. With all 1100 nodes, even with no efficiency gain, it will be number 3, as-is.

Finally, this is the a cluster of several firsts:

First major cluster with PowerPC 970
First major cluster with Apple hardware
First major cluster with Infiniband
First major cluster with Mac OS X (Yes, it is running Mac OS X 10.2.7, NOT Linux or Panther [yet])

Linux on Intel has been at this for years. This cluster was assembled in 3 months. There is no reason for the Virginia Tech cluster to remain at ~40% efficiency. It is more than reasonable to expect higher than 50%.

It's still destined for number 3, and its performance will likely even climb for the next Top 500 list as the cluster is optimized. The final results will not be officially announced until a session on November 18 at Supercomputing 2003.

Share
twitter facebook
- Re:Important items of note (Score:2, Interesting)
  
  by Ianoo ( 711633 ) writes: on Wednesday October 22, 2003 @03:13PM (#7283308) Journal
  
  I wonder how dual Xeon boxes would do using Infiniband? Probably a lot better than they're doing at the moment.
  
  Parent Share
  twitter facebook
- Re:Important items of note (Score:2, Interesting)
  
  by Evil Adrian ( 253301 ) writes: on Wednesday October 22, 2003 @03:18PM (#7283357) Homepage
  
  Officials at the school said that they were still finalizing their results and that the final speed number might be significantly higher.
  
  This will likely be the case.
  
  Why is this likely? The number dropped, why is it more likely to go up rather than down (or nowhere, for that matter)?
  
  Parent Share
  twitter facebook
  - Actually, it's already at 8.2 Tflop today (Oct 22) (Score:3, Informative)
    
    by daveschroeder ( 516195 ) * writes: on Wednesday October 22, 2003 @03:21PM (#7283397)
    
    So, yes, these numbers are preliminary, and yes, they WILL increase - they already are. See http://www.netlib.org/benchmark/performance.pdf (the official source of preliminary numbers), page 53.
    
    Parent Share
    twitter facebook
  - Re:Important items of note (Score:5, Informative)
    
    by Carnildo ( 712617 ) writes: on Wednesday October 22, 2003 @03:26PM (#7283449) Homepage Journal
    
    The number dropped because they used a better benchmark (testing all the nodes, rather than a subset). It'll probably go up because now they'll be able to tune the system to get around bottlenecks.
    
    Parent Share
    twitter facebook
- Re:Important items of note (Score:5, Insightful)
  
  by Durinia ( 72612 ) * writes: on Wednesday October 22, 2003 @03:21PM (#7283401)
  
  On the other side of the issue is that it places 4th in the current Top 500 list, which was released in June. We won't really know where it places on this "moving target" until the next list is released in November.
  
  Parent Share
  twitter facebook
  - Not really (Score:5, Informative)
    
    by daveschroeder ( 516195 ) * writes: on Wednesday October 22, 2003 @03:26PM (#7283442)
    
    The preliminary performance report at http://www.netlib.org/benchmark/performance.pdf contains the new entries for the upcoming list as well (see page 53).
    
    Parent Share
    twitter facebook
  - Re:Important items of note (Score:3, Informative)
    
    by mz001b ( 122709 ) writes: on Wednesday October 22, 2003 @04:29PM (#7284100)
    
    On the other side of the issue is that it places 4th in the current Top 500 list, which was released in June. We won't really know where it places on this "moving target" until the next list is released in November.
    The deadline for submission to the Nov 2003 Top 500 list was Oct. 1st (see call for proposals) [top500.org], so it has already passed. Any further improvements that they make to the scalability of the cluster should not be included. This is true for all the machines.
    
    Parent Share
    twitter facebook
- Also Important? (Score:3, Informative)
  
  by ThosLives ( 686517 ) writes: on Wednesday October 22, 2003 @03:34PM (#7283514) Journal
  
  If you read the fine print, the Nmax for the G5 was 100,000 higher than for the Linux cluster. Now, that's kind of interesting, because the G5 cluster was then only slightly slower doing a much bigger (450,000 Nmax vs 350,000 Nmax on the Xeons) problem. I wonder why they don't somehow scale the FLOPs to reflect this fact.
  Anyone know how much merit there is to using Nmax (or N1/2) to compare different systems?
  
  Parent Share
  twitter facebook
- Re:Important items of note (Score:2)
  
  by Oculus Habent ( 562837 ) * writes: <.moc.liamg. .ta. .tnebah.suluco.> on Wednesday October 22, 2003 @04:10PM (#7283864) Journal
  
  My (completely untrained) guess is they are dealing with network saturation. The computers themselves don't get slower because there are more of them, so...
  
  Could they add NICs to each computer, bond them (probably need to write something for this), and set up parallel networks with each set of cards to improve bandwidth?
  
  Don't enough about the cluster's setup to say much at this point.
  
  Parent Share
  twitter facebook
  - Re:Important items of note (Score:2)
    
    by Hoser McMoose ( 202552 ) writes: on Wednesday October 22, 2003 @04:31PM (#7284120)
    
    No they could not bond NICs, because they're using Infiniband and not ethernet. Besides, I think that they are being limited more by latency than bandwidth, so therefore adding bandwidth isn't going to help much. What's worse, their bandwidth limit is being reached inside the computer, with their chip to chip interconnect having less bandwidth than their computer to computer interconnect.
    
    This is not altogether surprising, given that they are using a desktop computer and trying to shoehorn it into a supercomputer role. They are bound to run into some limitations.
    
    Parent Share
    twitter facebook
- Re:Important items of note (Score:2)
  
  by coolmacdude ( 640605 ) writes: on Wednesday October 22, 2003 @04:32PM (#7284125) Homepage Journal
  
  Second, they're only 0.224 Tflops away from the only Intel-based cluster above it.
  So saying "all the Intel machines" in the story is kind of inaccurate
  
  I was trying to refer to the fact that sometimes the Mac zealots, in the midst of their zealotry,
  lose sight of reality and simply lump all non-Mac related things into one huge category, even if it really isn't one.
  
  Parent Share
  twitter facebook
- - AltiVec won't help here (Score:5, Informative)
    
    by Troy Baer ( 1395 ) writes: on Wednesday October 22, 2003 @04:40PM (#7284204) Homepage
    
    The Linpack benchmark, as compiled to the G5, is not utilizing the processor to its fullest. The school is still in the process of adding Altivec compiler optimizations, which should drastically improve the results.
    
    The AltiVec instructions support only single precision (32-bit) floating point operations, and the core routine in the Parallel Linpack Benchmark is DGEMM() which is double precision (64-bit). The G5 already has two double precision FPUs, each of which can do a multiply/add op every clock cycle.
    My feeling is that the ~40% efficiency seen on the larger scale run is an indication that either VA Tech spent very little time tuning the problem size or they didn't design their InfiniBand fabric to really handle 1100 nodes hammering away at Parallel Linpack. (Given that they've been extremely vague about how their IB network is structured, I fear it may be the latter.)
    
    Right now, the processor is behaving essentially as a G4 with a bigger fan and more memory addresses. Rumor has it that tweeking the compiler to abuse the Altivec unit may push the system above the theoretical limit in some calculations.
    
    I doubt that's true, especially if they're using the IBM PPC compilers. The G4 has both significantly less memory bandwidth and a single double-precision-capable FPU, whereas the G5 is basically a single-core Power4 with an AltiVec unit in place of some cache. IBM's compilers (despite being a little wonky as far as naming and argument syntax) generally produce pretty fast code.
    --Troy
    
    Parent Share
    twitter facebook
the REAL reason to build a top-5 supercomputer (Score:5, Funny)

by Anonymous Coward writes: on Wednesday October 22, 2003 @03:11PM (#7283284)

What they're not telling you is that the real reason they are building a supercomputer is because the only copy of the router passwords is GPG-encrypted, and they lost the key.

Share
twitter facebook
- - Re:the REAL reason to build a top-5 supercomputer (Score:2)
    
    by Ungrounded Lightning ( 62228 ) writes: on Wednesday October 22, 2003 @04:03PM (#7283777) Journal
    
    yes, much faster to build, optimize, and crack a key-based crypo scheme than to rewrite the tables in a router on a large college
    
    Not quite. But you still need a supercomputer even to rewrite the tables. Especially after you install the supercomputer. B-)
    
    [/tongue-in-cheek]
    
    Parent Share
    twitter facebook
Too good to be true... (Score:5, Insightful)

by mrtroy ( 640746 ) writes: on Wednesday October 22, 2003 @03:12PM (#7283289)

That 80% efficiency simply sounded too good to be true, and it was.

Now its at 44%. Thats not a small drop, thats a MASSIVE drop.

They didnt predict any loss in going from a small subset to the whole system? Or was it a publicity stunt (we can outperform everyone! our names are __________!)

Share
twitter facebook
- PARENT IS NOT A TROLL (Score:2)
  
  by w42w42 ( 538630 ) writes: on Wednesday October 22, 2003 @03:54PM (#7283700)
  
  No troll here.
  
  Parent Share
  twitter facebook
- new updated troll (Score:3, Funny)
  
  by McAddress ( 673660 ) writes: on Wednesday October 22, 2003 @06:47PM (#7285404)
  
  I have been sitting here by my 1100 node G5 cluster trying to copy a 17.6 MB file for the last 20 minutes. It is so freaking slow now that I only get 44% efficiency. On my 1.5 Ghz P3 I would be able to do this in under 20 seconds. .....
  
  Parent Share
  twitter facebook
Big mac cluster.. (Score:5, Funny)

by jandrese ( 485 ) * writes: <kensama@vt.edu> on Wednesday October 22, 2003 @03:12PM (#7283295) Homepage Journal

That's nothing, last time I benchmarked my Big Mac Cluster (100 Big Macs) it came to almost 57.6 megacalories. Those Apples will never be able to match that!

Share
twitter facebook
- Re:Big mac cluster.. (Score:2)
  
  by Jeff DeMaagd ( 2015 ) writes: on Wednesday October 22, 2003 @03:22PM (#7283410) Homepage Journal
  
  Calorie is a unit of heat transfer, 1 Cal (uppercase C) is the amount of heat required to raise the temperature of 1g of water 1 degree celsius. Lower case calorie is 1/100th that.
  
  Parent Share
  twitter facebook
  - Re:Big mac cluster.. (Score:3, Funny)
    
    by Frymaster ( 171343 ) writes: on Wednesday October 22, 2003 @03:33PM (#7283510) Homepage Journal
    
    1 Cal (uppercase C) is the amount of heat required to raise the temperature of 1g of water 1 degree celsius
    which brings up a totally off topic question.... a can of coke is 350 ml. it contains 300 calories.
    now, let's say i drink this coke. it is really cold - say 4 degrees. my body temperature is a nice, mamallish 37 degrees. by drinking this coke i am warming up 350 g of what is essentially water from the temperature of the can to that of my body - a difference of 33 degress.
    33c * 350ml = 11550 calories.
    since the coke is only 300ish calories in the first place...
    why don't i lose weight drinking ice cold coke?
    
    Parent Share
    twitter facebook
    - Re:Big mac cluster.. (Score:5, Funny)
      
      by cK-Gunslinger ( 443452 ) writes: on Wednesday October 22, 2003 @03:41PM (#7283585) Journal
      
      I find your ideas intriguing and would like to subscribe to your newsletter.
      
      Parent Share
      twitter facebook
    - Re:Big mac cluster.. (Score:4, Informative)
      
      by zulux ( 112259 ) writes: on Wednesday October 22, 2003 @03:43PM (#7283598) Homepage Journal
      
      since the coke is only 300ish calories in the first place...
      
      For consumers, food calories are really kilo-calories. So in this case, you coke has 300,000 physic-style calories.
      
      If you look at a euopean food-labels, sometime you can seem them writen as kcal.
      
      Parent Share
      twitter facebook
      - Re:Big mac cluster.. (Score:3, Informative)
        
        by lostchicken ( 226656 ) writes: on Wednesday October 22, 2003 @05:46PM (#7284858)
        
        A different unit, though. 1 kcal = 4.187 kilojoules. (1 calorie (not kcal) = energy to raise 1 gramme of water one degree c, 1 joule is the work done in countering one newton of force for one meter.)
        
        Parent Share
        twitter facebook
    - Re:Big mac cluster.. (Score:2)
      
      by gordyf ( 23004 ) writes: on Wednesday October 22, 2003 @03:47PM (#7283647)
      
      I've wondered that myself. It would seem that drinking cold water (or ice!) would be an excellent way to lose weight, but it doesn't seem to be that way...
      
      Parent Share
      twitter facebook
      - Re:Big mac cluster.. (Score:2)
        
        by jandrese ( 485 ) * writes: <kensama@vt.edu> on Wednesday October 22, 2003 @04:41PM (#7284212) Homepage Journal
        
        On the other hand, if you only have a few pounds of fat to remove, if you're already in lean condition, or if you just want to give Superhydration an informal trial for whatever reason, here are the most efficient guidelines to utilize:
        
        1) Purchase a 32-ounce, insulated, plastic bottle from which to sip your water.
        
        2) Start by sipping one gallon, or 128 ounces, of water a day. Do not go higher than 128 ounces per day for this informal trial period.
        
        3) Drink most of the water before 5pm.
        
        4) Keep the water ice cold. Remember, each ounce of 40 degrees Fahrenheit water requires approximately 1 calorie to warm it to a core body temperature of 98.6.
        
        5) Apply the above recommendations for at least 14 days.
        
        Wow, so if I spent all day drinking 128oz of ~3 degree C water, I could burn a whole 128 kcals? That's a little more than a single apple. And I get to spend half of the day looking for the nearest bathroom? I think actual exercise would be less effort and more effective, unless you're looking to exercise your bladder muscles.
        
        Parent Share
        twitter facebook
    - Re:Big mac cluster.. (Score:5, Informative)
      
      by Graff ( 532189 ) writes: on Wednesday October 22, 2003 @04:06PM (#7283809)
      
      The original poster was wrong when he said:
      
      1 Cal (uppercase C) is the amount of heat required to raise the temperature of 1g of water 1 degree celsius
      
      A Calorie (the one used on food labels) is actually a kilocalorie. A Calorie is therefore 1000 calories. 1 calorie is basically the amount of heat needed to raise 1g of water 1 degree celsius. (A calorie is actually 1/100 of amount of heat needed to get 1 gram of water from 0 degrees C to 100 degrees C, but that works out almost the same.)
      
      This is explained a bit on this web page. [reference.com]
      
      So warming a 4 degrees C, 350mL Coke to 37 degrees C would take (37 - 4) * 350 = 11550 calories. This is 11.55 kilocalories or 11.55 Calories. The Coke has around 300 Calories in nutritive value therefore you would gain 300 - 11.55 = 288.45 Calories of energy from a 4 degrees C, 350mL can of Coke.
      
      Parent Share
      twitter facebook
  - Re:Big mac cluster.. (Score:2)
    
    by hondo77 ( 324058 ) writes: on Wednesday October 22, 2003 @03:33PM (#7283512) Homepage
    
    Our thanks to the Anal Retentive Chef [goodeatsfanpage.com] for his guest editorial.
    
    Parent Share
    twitter facebook
  - Re:Big mac cluster.. (Score:2)
    
    by gerardrj ( 207690 ) * writes: on Wednesday October 22, 2003 @04:11PM (#7283881) Journal
    
    It's not a measure of heat transfer, it's a measure of energy. You could measure the output of an automobile engine in calories if you like. Convert calories to watts to HP to torque(more or less) to thrust, it's all a different scale of the same thing.
    
    Parent Share
    twitter facebook
- Re:Big mac cluster.. (Score:2, Funny)
  
  by mccrew ( 62494 ) writes: on Wednesday October 22, 2003 @03:29PM (#7283475)
  
  ...my Big Mac Cluster...
  Um, yeah, could I get some fries with that?
  
  Parent Share
  twitter facebook
  - Re:Big mac cluster.. (Score:2)
    
    by Valdrax ( 32670 ) writes: on Wednesday October 22, 2003 @04:22PM (#7284019)
    
    No, but you could get it at Fry's! [frys.com]
    
    Parent Share
    twitter facebook
- Re:Big mac cluster.. (Score:2)
  
  by 3Suns ( 250606 ) writes: on Wednesday October 22, 2003 @03:46PM (#7283636) Homepage
  
  Seeing as a large apple is about 100 kilocalories, you'd need a cluster of maybe 580 apples to best your Big Mac Cluster. If you go to an apple orchard I'm sure you could find a better price-performance ratio with apples than you could with Big Macs at McDonalds. Plus, most orchards will probably let you gather virtually unlimited quantities of fallen apples for free.
  
  Parent Share
  twitter facebook
Instant Numbers... (Score:3, Insightful)

by Dracolytch ( 714699 ) writes: on Wednesday October 22, 2003 @03:12PM (#7283297) Homepage

Not terribly surprising. Much like estimated death tolls for disasters, never believe the first set of benchmarks for a computer. Wait until thorough testing can be done before you start believing the numbers.

Y'all should know this by now. ;)
~D

Share
twitter facebook
Does anyone else have trouble reconciling... (Score:5, Funny)

by ikewillis ( 586793 ) writes: on Wednesday October 22, 2003 @03:14PM (#7283313) Homepage

"best price performance" and "Apple" in their minds?

Share
twitter facebook
- Re:Does anyone else have trouble reconciling... (Score:3, Insightful)
  
  by Experiment 626 ( 698257 ) writes: on Wednesday October 22, 2003 @05:10PM (#7284539)
  
  While some people have given the parent a flamebait mod and hostile replies, the poster makes a good (and humorous) point. Apple is not typically thought of in terms best price performance any more than, say, Cadillac is in the car industry. Macs are bought by those willing to pay a premium for that distinct Apple stying, OSX's slick interface with the power of Unix behind the scenes, the "it just works" factor, and so on. Those who don't care about the amenities and just want bang for the buck go for a Dell or eMachines or whatever. I personally find it quite interesting that a company whose image is more luxury than value and whose products are so much newer in this field than the Linux Beowulf clusters is able to achieve such an impressive level of performance for the cost.
  
  Parent Share
  twitter facebook
- It's a good price/performance, but not best. (Score:3, Interesting)
  
  by tmattox ( 197485 ) writes: <tmattox.ieee@org> on Wednesday October 22, 2003 @11:31PM (#7287484) Homepage
  
  I guess the original submission didn't see the slashdot article [slashdot.org] from August 23 about our KASY0 [aggregate.org] supercomputer breaking the $100 per GFLOPS barrier.
  KASY0 achieved 187.3 GFLOPS on the 64-bit floating point version of HPL, the same benchmark used on "Big Mac". While "Big Mac" is about 40 times faster on that benchmark, it is about 130 times the cost of KASY0 (~$40K vs ~$5200K). Considering the size difference, "Big Mac" is VERY impressive, but it can't claim to be the best price/performance supercomputer on the HPL benchmark.
  Note: KASY0 gets 482.6 GFLOPS (0.48 TFLOPS) on a 32-bit precision version of Linpack, satisfying our under $100 per GFLOPS claim [aggregate.org].
  Regardless, Virginia Tech's "Big Mac" is a very impressive machine. My congratulations to them!
  
  Parent Share
  twitter facebook
Catch Phrase (Score:4, Funny)

by humpTdance ( 666118 ) writes: on Wednesday October 22, 2003 @03:14PM (#7283314)

Best Price/Performance ratio = promotional video with the phrase:
"Virginia Tech: Home of the Poor Man's Supercomputer and Michael Vick."

Share
twitter facebook
This is NOT all that surprising. (Score:5, Insightful)

by dbirchall ( 191839 ) writes: on Wednesday October 22, 2003 @03:16PM (#7283331) Journal

A single G5 FPU (each CPU has 2) can do 1 64-bit (double precision) FLOPs per cycle, or 2 if and only if those two are a MULTIPLY and an ADD.
Apparently there are a lot of cases where a MULTIPLY and an ADD do come together like that, but I'm not surprised if LINPACK doesn't consist entirely of those pairs. ;)
The 17.6 TFLOP theoretical peak assumed a perfect case consisting entirely of MULTIPLY-ADD pairs. In a case assuming no MULTIPLY-ADD pairs, the theoretical peak is 8.8 TFLOPs.
7.4 TFLOPs is only 42% of 17.6 TFLOPs, but it's 84% of 8.8 TFLOPs. I suspect the actual "efficiency" of the machine lies somewhere in the middle.
(As for me, I'm happy with just ONE dualie...)

Share
twitter facebook
- Re:This is NOT all that surprising. (Score:2, Informative)
  
  by humpTdance ( 666118 ) writes: on Wednesday October 22, 2003 @03:30PM (#7283481)
  
  Until these applications are written in 64 bit code, it won't matter. Smeagol and Panther will still have to cross that bridge so old utilization rates will continue to apply.
  From: http://www.theregister.co.uk/content/39/31995.html [slashdot.org]
  
  The PowerPC architecture was always defined as a true 64-bit environment with 32-bit operation defined as a sub-set of that environment and a 32/64-bit 'bridge', as used by the 970, to "facilitate the migration of operating systems from 32-bit processor designs to 64-bit processors".
  The 'bridge' technology essentially allows the 970 to host 32-bit operating systems and apps that have been modified to support 64-bit addresses and larger files sizes as both Smeagol and Panther have. Adding 64-bit address support to existing applications lies at the heart of the optimisations for the Power Mac G5 that Apple suggests developers make.
  
  Parent Share
  twitter facebook
  - Re:This is NOT all that surprising. (Score:2)
    
    by Hoser McMoose ( 202552 ) writes: on Wednesday October 22, 2003 @04:09PM (#7283849)
    
    Uhh, you really don't know what you're talking about here do you? We're talking floating point code here, not integer code! You don't need Smeagol or Panther or any other cat to get 64-bit floating point code, DOS can handle that just fine!
    
    Essentially ALL processors with a floating point unit do 64-bit precision calculations. The old G4 and G3 did, the Pentium 4 does, the old 486 did, etc. etc. The whole 32-bit vs. 64-bit argument with these PowerPC 970 chips (and, in a similarly light, AMD64 chips) has to do with INTEGER registers and, more importantly, the size of pointers and address registers.
    
    That being said, the original parent probably missed something to. Supercomputers tend to do tasks that are easily vectorized, so therefore it's almost certain that the calculations that they were using were done using Altivec and not the standard floating point unit.
    
    Parent Share
    twitter facebook
- Re:This is NOT all that surprising. (Score:5, Informative)
  
  by hackstraw ( 262471 ) * writes: on Wednesday October 22, 2003 @03:36PM (#7283544)
  
  FWIW here are the efficiencies for the top 10 on www.top500.org:
  
  87.5 NEC Earth-Simulator
  67.8 Hewlett-Packard ASCI Q
  69.0 Linux Networx MCR Linux Cluster Xeon
  59.4 IBM ASCI White
  73.2 IBM SP Power3
  71.5 IBM xSeries Cluster
  45.1 Fujitsu PRIMEPOWER HPC2500
  79.2 Hewlett-Packard rx2600
  72.0 Hewlett-Packard AlphaServer SC
  77.7 Hewlett-Packard AlphaServer SC
  
  Parent Share
  twitter facebook
- Re:This is NOT all that surprising. (Score:2)
  
  by bnenning ( 58349 ) writes: on Wednesday October 22, 2003 @04:08PM (#7283833)
  
  Correct. Also note that one of the strengths of the G5 (and G4) is its vector units, which (afaik) can't be used for Linpack, because of the 64-bit precision requirements. For jobs that can use Altivec, the performance should be substantially better.
  
  Parent Share
  twitter facebook
- - - Re:I/O bandwidth and latency (Score:3, Informative)
      
      by Knobby ( 71829 ) writes: on Wednesday October 22, 2003 @05:09PM (#7284535)
      
      Grumble... Go take a look at Apple's description of the G5 architecture [apple.com] before spouting.. Here's the relevant lines:
      
      Each PowerPC G5 processor has its own dedicated 1GHz bidirectional interface to the system controller for a mind-boggling 16GB per second of total bandwidth -- more than twice the 6.4-GBps maximum bandwidth of Pentium 4-based systems using the latest PC architecture
      
      800MHz HyperTransport interconnects for a maximum throughput of 3.2GB per second.
      
      Apple uses the same basic memory set-up as the AMD Opteron.
      
      Parent Share
      twitter facebook
      - Re:I/O bandwidth and latency (Score:3, Informative)
        
        by Hoser McMoose ( 202552 ) writes: on Wednesday October 22, 2003 @10:30PM (#7287114)
        
        Err, Apple's G5 and the AMD Opteron don't have an even remotely related memory setup. The G5 looks a lot more like the AthlonXP and AthlonMP setups. The Opteron has an integrated 128-bit wide DDR memory controller, connects multiple CPUs directly through cache-coherent Hyptertransport links, and uses additional 32-bit, 1600MT/s HT links (3.2GB/s in each direction) to connect the CPU directly to the I/O chips.
        
        The Powermac G5 uses up to 1GT/s, 64-bit wide version of IBM's Elastic I/O bus to connect each processor to a memory controller chip, which in turn has a pair of 64-bit wide DDR memory controllers. These buses are also shared for the processors I/O needs, which are passed over a 800MT/s, 16-bit wide hypertransport link to the PCI-X controller.
        
        As for the width and speed of the Hypertransport links, Apple is very confusing on this front. In the document you linked they say "two bidirectional 16-bit, 800MHz HyperTransport interconnects for a maximum throughput of 3.2GB per second." In their PowerMac G5 Tech Specs PDF they say "two bidirectional 800MHz HyperTransport interconnects for a maximum throughput of 1.6 GBps." So which is it? And just what bandwidth are they measuring?
        
        The PowerMac does indeed have two separate bi-directional Hypertransport links, the first connects the memory and processor controller chip to the PCI-X controller, and the second goes from the PCI-X controller to the extra I/O chips. It seems to me like the page you quoted is ADDING the bandwidth of the two daisy-chained hypertransport links, which would be TOTALLY incorrect.
        
        My numbers came from the fact that a 16-bit (8-bits per direction) 800MT/s hypertransport link gets you only 800MB/s in each direction. Of course, it could really indeed be a "800MHz" hypertransport link, ie a 1600MT/s link since Hypertransport is a DDR protocol, but I highly doubt that since every other specification they mention just doubles the "MHz" number anytime they encounter a DDR bus (not that Apple is the only one to do this, Intel's "800MHz" bus runs at either 200MHz or 400MHz, depending on which clock you look at).
        
        Parent Share
        twitter facebook
The Mac cluster is still on top per CPU (Score:5, Interesting)

by BWJones ( 18351 ) writes: on Wednesday October 22, 2003 @03:19PM (#7283363) Homepage Journal

While these numbers will no doubt come as a disappointment for Mac zealots who wanted to blow away all the Intel machines, it should still be noted that this is the best price/performance ratio ever achieved on a supercomputer.

It still bests all other Intel hardware with only the Alpha hardware on top. And given the CPU count, even the Alpha hardware does not match it. Look at the numbers.....The Linux based 2.4Ghz cluster has almost 200 more CPU's on board with a 217 Gflop/sec difference. The Alpha clusters are running anywhere from 1,984 to 6,048 more CPU's.

Share
twitter facebook
- Re:The Mac cluster is still on top per CPU (Score:2)
  
  by Durinia ( 72612 ) * writes: on Wednesday October 22, 2003 @04:00PM (#7283747)
  
  The Mac cluster is still on top per CPU
  From the same document the Mac proponents have been quoting from: Dondarra Doc [netlib.org]
  Table 3 - page 53:
  Big Mac -> Rmax: 8164 Processors: 1936
  Cray X1 -> Rmax: 2932.9 Processors: 252
  Please be careful when making general statements. Thank you.
  That said, yes, it has the highest per CPU performance of the machines with commodity processors. (that are listed, at least - including the year-old Xeons)
  
  Parent Share
  twitter facebook
  - Re:The Mac cluster is still on top per CPU (Score:2)
    
    by BWJones ( 18351 ) writes: on Wednesday October 22, 2003 @04:08PM (#7283838) Homepage Journal
    
    Big Mac -> Rmax: 8164 Processors: 1936
    Cray X1 -> Rmax: 2932.9 Processors: 252
    
    I did say It still bests all other Intel hardware... Commodity clusters are entirely different beasts than dedicated supercomputers and this is exactly why I chose the terminology "clusters" rather than supercomputers. Also, check out the architecture of real "supercomputers". Most of the real costs are in CPU interconnectivity.
    
    Parent Share
    twitter facebook
- It's all about AMD (Score:2)
  
  by Alereon ( 660683 ) writes: on Wednesday October 22, 2003 @04:07PM (#7283828)
  
  Remember them? Manufacturer of the highest performance x86 processors available? An array of dual-Opteron systems could be built with dramatically lower price/performance ratio than any other platform, especially G5s or Intel Xeons.
  
  Parent Share
  twitter facebook
  - - - It's really fixed this time!! (Score:3, Informative)
        
        by StewedSquirrel ( 574170 ) writes: on Thursday October 23, 2003 @03:29AM (#7288435)
        
        The G5's memory controller is built into the U3 IC, which is essentially the "north bridge"- it is NOT built into the CPU.
        
        It connects to the CPU via the "Apple Processor Interface" NOT via hypertransport. It connects to it's memory controller at 1/2 the CPU speed, unlike Opteron and Athlon 64 which connect to the memory controller at FULL CPU SPEED.
        
        Documentation:
        developer.apple.com [apple.com]
        apple.com [apple.com] (thanks for the link)
        
        From the U3 Northbridge, G5 uses hypertransport to connect to the other peripherials at 3.2GB/s.
        Opteron supports a hypertransport rate of 6.4 GB/s [tomshardware.com] directly from the CPU.
        
        The Opteron 4xx and 8xx models also happen to have THREE of these hypertransport channels connected in a cross-bar configuration for SMP systems, giving EACH CPU a dedicated 6.4GB/s connection, rather than the G5 architecture which much share that connection (since there is only one U3 chip in a dually G5).
        
        Support for PCI-X in the G5 by standard is a great thing. I wish more AMD systems contained it... I appreciate their native support of firewire and gigabit eithernet. But seriously... do you really want to argue architecture against a workstation class CPU? I'm a bit dissapointed by the Athlon 64, but the Athlon 64 FX (desktop version of Opteron) and Opteron lives up to most of my expectations and I expect to see more speeds out in the near future.
        
        Stewey
        
        Parent Share
        twitter facebook
- - Re:The Mac cluster is still on top per CPU (Score:3, Informative)
    
    by BWJones ( 18351 ) writes: on Wednesday October 22, 2003 @08:48PM (#7286363) Homepage Journal
    
    Besides, performance per CPU doesn't matter much in these benchmarks, what matters is total bang for total buck, at the prices at which regular folks can get these machines (no special "we need a showcase" kind of deals). I suspect the 2.4GHz-based clusters are still a better deal than either the G5 or a 3.2GHz cluster, more CPUs or not.
    
    Actually, if you read back a little bit, you will find that the contract was awarded to Apple because they gave the best bang for the buck and it turns out that Dell optioned clusters would have been more expensive.
    
    Parent Share
    twitter facebook
Now at 8.2 Tflop as of today (Oct 22) (Score:5, Informative)

by daveschroeder ( 516195 ) * writes: on Wednesday October 22, 2003 @03:19PM (#7283366)

See http://www.netlib.org/benchmark/performance.pdf [netlib.org] page 53.

Since yesterday's release at 7.41 Tflop, the G5 cluster has already increased almost a Tflop, and is now ahead of the current #3 MCR Linux cluster, and about 0.5 Tflop behind a new Itanium 2 cluster.

Share
twitter facebook
- Re:Now at 8.2 Tflop as of today (Oct 22) (Score:2)
  
  by morcheeba ( 260908 ) writes: on Wednesday October 22, 2003 @03:31PM (#7283495) Journal
  
  So, at that rate, they should be #2 by Haloween!
  
  Parent Share
  twitter facebook
- Re:Now at 8.2 Tflop as of today (Oct 22) (Score:2)
  
  by mz001b ( 122709 ) writes: on Wednesday October 22, 2003 @04:32PM (#7284128)
  
  Since yesterday's release at 7.41 Tflop, the G5 cluster has already increased almost a Tflop, and is now ahead of the current #3 MCR Linux cluster, and about 0.5 Tflop behind a new Itanium 2 cluster.
  But the deadline for submissions to the Nov. 2003 Top 500 list was Oct. 1, so these improvements should not be counted in this list.
  
  Parent Share
  twitter facebook
Big Mac? How does that compare with a WOPR? (Score:5, Funny)

by Anonymous Coward writes: on Wednesday October 22, 2003 @03:19PM (#7283367)

/Watched WarGames too many times as a kid.

Share
twitter facebook
And mac fans are complaining? (Score:2)

by downix ( 84795 ) writes: on Wednesday October 22, 2003 @03:20PM (#7283380) Homepage

If someone used off-the-shelf machines that my company made, and got even into the top-10, you can bet your bottom dollar that the next thing in my job-pile would be a "make an announcement that we're in the top-10 fastest computers in the world."

This is fantastic, no matter what way you cut it! Using commodity components, these folk have turned the G5 into a real champion. No longer do budgets have to be in the hundreds, or even tens of millions to get a top-notch supercomputer. And this is not even the end, at the rate things are going, I would highly suspect that IBM is considering the G5 for one of it's own supercomputer projects, so hope is not lost yet. Imaging an IBM supercomputer, for under $1 million! Beat up your favorite chess champion and still afford the mansion in the Bahamas. 8)

Share
twitter facebook
- Re:And mac fans are complaining? (Score:2)
  
  by bhtooefr ( 649901 ) writes: <bhtooefr.bhtooefr@org> on Wednesday October 22, 2003 @03:50PM (#7283666) Homepage Journal
  
  Umm, they've got the POWER4, which is internally the same thing as the G5 (which they also make). WHY would they use the consumer-grade G5 (that Apple is demanding in mass quantities) when they can use the POWER4 that does the same thing and is server-grade (and IBM already uses)?
  
  Parent Share
  twitter facebook
  - Re:And mac fans are complaining? (Score:5, Interesting)
    
    by gerardrj ( 207690 ) * writes: on Wednesday October 22, 2003 @03:57PM (#7283725) Journal
    
    Because the Power4 is hotter and uses more current than the G5. To use 2200 Power 4 CPUs they would have to about triple the cooling capacity of the room. For all the heat and power, the Power4 lacks the AltiVec units that allow the G4/G5 to process vector operations so quickly.
    
    The G5 is also significantly lower cost than the Power4
    
    Parent Share
    twitter facebook
- - Re:What "commodity"? (Score:2)
    
    by dissy ( 172727 ) writes: on Wednesday October 22, 2003 @03:54PM (#7283702)
    
    > I'd like to see hot machines and clusters built out of something I could afford
    > to buy on a couple month's wages.
    
    Well, im sure we all do.
    I also want a house for what I can pay in two months wages.
    
    But these things do have costs.
    Even if each computer was $1 total, for 2000 of them thats $2000 right there.
    So even as much as $10 a computer would be 'affordable' thou definatly more than two months pay. But I have hope of actually saving up $20,000 after awhile.
    
    You find me $10 computers that can do 10 gflop, and we will be in business :)
    
    Parent Share
    twitter facebook
They didn't save the world AGAIN? (Score:4, Insightful)

by ianscot ( 591483 ) writes: on Wednesday October 22, 2003 @03:20PM (#7283385)

Yet another Apple product that failed to save the world. Lately they do nothing but disappoint us. Boo.
First you have the iTunes store which doesn't do anything but give the average user basically anything he or she might have wanted to have in on online music store. Despite its being free, we're all cheesed off that it doesn't support OGG, or it's meant partly to push iPods (duh), or whatever.
Now this -- a supercomputer that has, to quote that again, the "best price/performance ratio ever achieved on a supercomputer." But dang it all, it doesn't completely blow away every established precedent -- it's just in the top five on the usual list of comparisons. One more crushing disappointment.
From Microsoft, we just want products that don't completely ream us. From Apple, we want the entire world to seem a little friendlier and cooler with every product release, every dot-incremenent OS update. They both disappoint us, but the expectations seem a little different...

Share
twitter facebook
- Re:They didn't save the world AGAIN? (Score:2)
  
  by MasterVidBoi ( 267096 ) writes: on Wednesday October 22, 2003 @04:00PM (#7283744)
  
  I know this is really nitpicking, and is somewhat offtopic (but there isn't a front page iTunes thread at the moment), but it probably needs to be said.
  
  iTunes for Windows, just like Mac iTunes, does it's decoding using Quicktime. As crappy as you think Quicktime Player software is, the backend Quicktime library is very nice, especially in regards to it's modularity.
  
  Any app that uses Quicktime Lib can now play AAC files (even the iTMS 'protected' ones), not just iTunes. Of course, not may Windows apps use quicktime, but the ability is still there.
  
  Similarly, you can make Quicktime play OGG by installing a Quicktime OGG component (http://qtcomponents.sourceforge.net/). By extension, your shiny new Windows iTunes now plays OGG. Have fun :)
  
  Parent Share
  twitter facebook
What about the RAM? (Score:2)

by DAldredge ( 2353 ) writes: <SlashdotEmail@GMail.Com> on Wednesday October 22, 2003 @03:24PM (#7283431) Journal

Do these benchmark results take into account that software they have to run to check for memory errors?

Share
twitter facebook
I am sick and tired... (Score:3, Funny)

by DavidBrown ( 177261 ) writes: on Wednesday October 22, 2003 @03:28PM (#7283472) Journal

of all of these so-called "benchmark" discussions. Everyone really knows, in their heart of hearts, that the only valid benchmark is to be found in real-world applications such as Quake III. I want to know how many fps this alleged "supercomputer" gets.

Share
twitter facebook
Moore's Law applied (Score:3, Interesting)

by moof-hoof ( 678977 ) writes: on Wednesday October 22, 2003 @03:34PM (#7283517)

...it should still be noted that this is the best price/performance ratio ever achieved on a supercomputer.
Yes, but doesn't Moore's Law and the commodification of computer hardware suggest that each new generation supercomputer will have the best price/performance ratio?

Share
twitter facebook
- Re:Moore's Law applied (Score:2)
  
  by w42w42 ( 538630 ) writes: on Wednesday October 22, 2003 @04:06PM (#7283816)
  
  A very excellent point. I was also wondering how much time has passed between the time the Intel cluster and this Apple cluster were constructed. Would put things into a little more perspective regarding cost.
  
  Parent Share
  twitter facebook
Efficiency: switch topology? (Score:3, Insightful)

by mfago ( 514801 ) writes: on Wednesday October 22, 2003 @03:36PM (#7283541)

Efficiency is strongly dependent on the interconnect. Does anyone know if the 128 node benchmark (that supposedly showed ~80% efficiency) was run with only one Infiniband switch -- i.e. all nodes connected through only one switch?

BTW, the performance never was stated to be 17 TF, so it did not drop to 7.4 (or whatever it ends up to be).

Share
twitter facebook
- Re:Efficiency: switch topology? (Score:2)
  
  by op00to ( 219949 ) writes: on Wednesday October 22, 2003 @04:21PM (#7283998)
  
  #1: You're correct that no one stated it was 17 TF.
  #2: Sadly, you're still wrong. It was stated that it achieved "around 14 TFlops".
  
  Big Mac achieves around 14 TFlops with 128 Nodes [slashdot.org]
  
  Posted by CmdrTaco on 14:24 16 October 2003
  
  Parent Share
  twitter facebook
Price vs Preformance (Score:2, Interesting)

by Metex ( 302736 ) writes: on Wednesday October 22, 2003 @03:37PM (#7283546) Homepage

While I am amazed at the initial price vs preformance that this cluster of macs have obtained I am worried about the eventual cost all the electricity and cooling will be for the cluster. I remeber reading in some random article that the electricity used to cool and power the computer was extimated around 3,000 midrange homes. Just from a quick calculation of homes x $100 x 12 months we get the horrible figure of 3.6mil. So over a 10 year lifespan of the cluster it will cost 36mil more the the current price.

While it is still cheaper then the original cost of Intell or IBM super computers I personaly would rather spend more and waste alot less electricity, since if I remeber correctly the cost of engery for comparable super computers was in the range of 0.5 mil-1 mil. Although they are stationed in other countries so the cost of electricity could be dramaticly less in japan then in america but I doubt it. Someone should really get the kW per hour used by the top 5 super computers and then calculate the price per year based on that.

Share
twitter facebook
- Re:Price vs Preformance (Score:2)
  
  by bnenning ( 58349 ) writes: on Wednesday October 22, 2003 @04:03PM (#7283768)
  
  I remeber reading in some random article that the electricity used to cool and power the computer was extimated around 3,000 midrange homes.
  
  That can't possibly be right. There's no way that the cluster's power requirements are over 1 home's worth per CPU. Maybe they just added a zero and it's supposed to be 300, but even that sounds very high.
  
  Parent Share
  twitter facebook
- Re:Price vs Preformance (Score:2)
  
  by be-fan ( 61476 ) writes: on Wednesday October 22, 2003 @04:04PM (#7283796)
  
  That makes no sense. There are only 1100 nodes. That means that each node takes as much electricity as 3 mid-range homes?
  
  Parent Share
  twitter facebook
  - Re:Price vs Preformance (Score:2)
    
    by cyberassasin ( 4943 ) writes: <bmfrankNO@SPAMgmail.com> on Wednesday October 22, 2003 @04:19PM (#7283971) Homepage
    
    You forget the cost of cooling..... It is possible that the number is correct, but I don't think that it is very inefficient.
    
    Cooling takes lots of power, as you can see when the US has a hot summer and the grid and power plants struggle to keep up with demand.... The nodes do not consume that much power, relatively speaking
    
    Apple specs for the XServe dual processor max cinfiguration have maximum power consumption at 244W [apple.com]
    
    I doubt that the G5 dual processor is much more than that. I haven't seen power consumption data for the G5's yet.
    
    Parent Share
    twitter facebook
- Re:Price vs Preformance: Off an order of magnitude (Score:5, Interesting)
  
  by G4from128k ( 686170 ) writes: on Wednesday October 22, 2003 @04:23PM (#7284041)
  
  I think that magazine article must be wrong. If 1100 Macs use as much power as 3000 homes, then each mac is using about 3 houses worth of power. That seems excessive unless the home is in a 3rd world country or those 9 fans are really really running full blast. More likely, each G5 (with networking and cooling equipment) uses a few hundred watts. Even at 500 W/Mac, 1100 Macs, $0.15/kWH, 24 Hr/day, 365 day/year the cluster costs about $722,700/year. More likely, each Mac probably only consumes an average of 300 W max and is not running full tilt 24x7, so the cost is maybe around $300-$400k/year.
  
  But your point is a good one. I often wonder about the environmental economics of people running SETI, Folding@Home, etc. on older machines. Most of those older "spare" CPU-cycles are quite costly in terms of electricity relative to newer faster machines that do an order of magnitude more computing with the same amount of electricity.
  
  Parent Share
  twitter facebook
  - Re:Price vs Preformance: Off an order of magnitude (Score:3, Informative)
    
    by Helter ( 593482 ) writes: on Thursday October 23, 2003 @02:03AM (#7288164)
    
    You're forgetting the AC costs... If you've ever worked in a DC you know that the room itself can get mighty toasty, and toasty air leads to cooked systems.
    
    Each processor, drive, and switch generates heat which is dissipated into the air. Untouched that heat accumulates and will kill the entire thing. With 1100 dual processor nodes running (and you can be they'll each be running at pretty close to full tilt) constantly that's a hell of a lot of heat that needs to be removed from the air.
    
    Parent Share
    twitter facebook
Thats nothing (Score:4, Funny)

by madpierre ( 690297 ) writes: on Wednesday October 22, 2003 @03:40PM (#7283583) Homepage Journal

I installed a button on the front of my cluster
to manually clock the CPU's.

So far i've managed ONE whole flop.

My record is for the slowest supercomputer
on the planet.

Share
twitter facebook
- Re:Thats nothing (Score:3, Funny)
  
  by IM6100 ( 692796 ) writes: <elben@mentar.org> on Wednesday October 22, 2003 @04:10PM (#7283866)
  
  Build a computer that uses all CMOS static registers.
  
  Attach a hall-effect sensor to a hamster wheel to drive the clock.
  
  Go out and buy a hamster.
  
  Parent Share
  twitter facebook
  - Re:Thats nothing (Score:2, Funny)
    
    by madpierre ( 690297 ) writes: on Wednesday October 22, 2003 @04:22PM (#7284018) Homepage Journal
    
    Them Hamsters are pretty damn quick.
    Do they byte ?
    
    Parent Share
    twitter facebook
Missing the point (Score:2)

by Uttles ( 324447 ) writes: <uttles@g m a i l.com> on Wednesday October 22, 2003 @04:04PM (#7283793) Homepage Journal

If this cluster was MAC and anywhere near the size/cost of other clusters it would easily be number 1, assuming of course they do workout those efficiency problems.

Share
twitter facebook
8 TFlops on a single board anyone? (Score:2)

by Opiuman ( 172825 ) writes: <redbeard.gmail@com> on Wednesday October 22, 2003 @04:04PM (#7283795) Homepage

Check out the Englight256 [lenslet.com]... Coming soon to a military installation near you...

Share
twitter facebook
PPC64 optimizations? (Score:2)

by macdaddy ( 38372 ) writes: on Wednesday October 22, 2003 @04:09PM (#7283844) Homepage Journal

Does anyone know if Linpack was optimized for PPC hardware, specifically the 64-bit G5 with all its bells and whistles? That makes quite a difference.

Share
twitter facebook
Price/performance and Moore's Law (Score:2)

by Mignon ( 34109 ) writes: <satan@programmer.net> on Wednesday October 22, 2003 @04:10PM (#7283872)

... it should still be noted that this is the best price/performance ratio ever achieved on a supercomputer.
Noted. And go VT, go Apple! Now, with the cheerleading out of the way, I wonder something - with Moore's law and all still applying pretty well, just getting the latest-and-greatest any home computer architecture will all but guarantee you pretty good price/performance.
As another poster pointed out, someone's recent laptop could do as well on Linpack as a 1992 supercomputer.
So what I think would be interesting would be a kind of adjustment for Moore's law, sort of like how prices are adjusted for inflation when comparing, say, the cost of building the Empire State Building with the cost of building the World Trade Center.
Any economists out there with any good ideas?

Share
twitter facebook
Processor architecture and application performance (Score:2)

by blanks ( 108019 ) writes: on Wednesday October 22, 2003 @04:12PM (#7283889) Homepage Journal

Interesting link describing the processor architecture and application performance in modern supercomputers.

Good read for anyone interested in some of the background in current super computers and what they used for testing.
Heres the link. [jukasisters.com]

Share
twitter facebook
Scalability (Score:5, Informative)

by jd ( 1658 ) writes: <imipak@ya[ ].com ['hoo' in gap]> on Wednesday October 22, 2003 @04:16PM (#7283935) Homepage Journal

First, scalability is highly non-linear. See Amdahl's Law. Thus, the loss of performance is nothing remarkable, in and of itself.

The degree of loss is interesting, and suggests that their algorithm for distributing work needs tightening up on the high-end. Nonetheless, none of these are bad figures. When this story first broke, you'll recall the quote from the top500 list maintainer who pointed out that very few machines had high performance ratings, when they got into the large numbers of nodes.

I'd say these are extremely credible results, well worth the project team congratulating themselves. If the team could open-source the distribution algorithms, it would be interesting to take a look. I'm sure plenty of Mosix and BProc fans would love to know how to ramp the scaling up.

(The problem of scaling is why jokes about making a Beowulf cluster of these would be just dumb. At the rate at which performance is lost, two Big Macs linked in a cluster would run slower than a single Big Mac. A large cluster would run slower than any of the nodes within it. Such is the Curse that Amdahl inflicted upon the superscaler world.)

The problem of producing superscalar architectures is non-trivial. It's also NP-complete, which means there isn't a single solution which will fit all situations, or even a way to trivially derive a solution for any given situation. You've got to make an educated guess, see what happens, and then make a better informed educated guess. Repeat until bored, funding is cut, the world ends, or you reach a result you like.

This is why it's so valuable to know how this team managed such a good performance in their first test. Knowing how to build high-performing clusters is extremely valuable. I think it not unreasonable to say that 99% of the money in supercomputing goes into researching how to squeeze a bit more speed out of reconfiguring. It's cheaper to do a bit of rewiring than to build a complete machine, so it's a lot more attractive.

On the flip-side, if superscaling ever becomes something mere mortals can actively make use of, understand, and refine, we can expect to see vastly superior - and cheaper - SMP technology, vastly more powerful PCs, and a continuation of the erosion of the differences between micros, minis, mainframes and supercomputers.

It will also make packing the car easier. (* This is actually a related NP-complete problem. If you can "solve" one, you can solve the other.)

Share
twitter facebook
point missed (Score:2, Insightful)

by gerardrj ( 207690 ) * writes: on Wednesday October 22, 2003 @04:16PM (#7283936) Journal

Most responses in here are about how the G5 should be performing better, or should have better numbers than the Xenon or Sparc, or whatever.
What seems to be missing from most of the conversation is that it's not the Mac's that are loosing efficiency per se, it's the network (the interconnects) that is slowing the machine as a whole down. I know little about the LinPac test, but I would assume that it's written to test/stress the entire machine: CPU, disk, memory and interconnects. If the Macs can finish parts of a problem really fast, but can't get new data in to the nodes fast enough, that will casue a tremendous loss in effieciency.
Perhaps they need a mechanism for buffering new data on the nodes so that incoming and outgoing data can stream as the network is available and keep the CPUs working all the time.

Share
twitter facebook
32 bit numbers? (Score:2)

by Barbarian ( 9467 ) writes: on Wednesday October 22, 2003 @04:20PM (#7283981)

So, in all these "maximum speed tests", what is being used, 32 bit reals or 64 bit reals? The difference is that in solving large non-linear systems, the higher precision numbers result in a faster solution, but operations involving doubles will resulting a lower gflops measurement with benchmarks (although a solution may in fact take 10x less iterations).

Share
twitter facebook
- Re:32 bit numbers? (Score:2)
  
  by cweber ( 34166 ) writes: <[cwebersd] [at] [gmail.com]> on Wednesday October 22, 2003 @04:39PM (#7284197)
  
  The Linpack benchmark uses floats, not reals. Double precision floats, to be exact.
  
  Parent Share
  twitter facebook
Congrats to the VT team (Score:2)

by psyconaut ( 228947 ) writes: on Wednesday October 22, 2003 @04:42PM (#7284229)

(Disclaimer: I'm a Mac user).

I still think #4 in the world is pretty damn impressive for Apple hardware! And it looks like there might be some small performance improvements to come.

I think everyone involved did a pretty damn good job! Have a beer on me.

-psy

Share
twitter facebook
This also makes the Big Mac... (Score:2, Funny)

by Anonymous Coward writes: on Wednesday October 22, 2003 @04:49PM (#7284289)

the first supercomputer to feature exactly 1 mouse-button.

(hides/ducks - I ain't an anonymous coward for nothing!)

Share
twitter facebook
seti@home not listed (Score:5, Interesting)

by suitti ( 447395 ) writes: on Wednesday October 22, 2003 @05:09PM (#7284530) Homepage

The 21st version of this list does not
show the SETI@Home project. The top entry
is NEC at 35 terraflops. Today's SETI@Home
average for the last 24 hours is 61 terraflops.
It may be a virtual supercomputer, but it
is producing real results.

Share
twitter facebook
- Re:Frys... (Score:2)
  
  by gl4ss ( 559668 ) writes: on Wednesday October 22, 2003 @03:25PM (#7283438) Homepage Journal
  
  just pour some coke(liquid kind of) inside few of them.
  
  i'd think it would fry then.
  
  Parent Share
  twitter facebook
- Re:Some tweaking will do it good... (Score:2, Interesting)
  
  by defl8ed ( 690375 ) writes: on Wednesday October 22, 2003 @03:51PM (#7283670) Journal
  
  Yes, the G5 should be capable of more than a little better performance than "a Xeon", but what I find interesting is that it is a Xeon which was initially released well over a year ago by Intel. What I am curious about is if someone could build an equally "cost-efficient" super computer based on more recent intel hardware. The differences in speed, cache, front side bus, etc. that Intel has made in the past year would no doubt lead to higher numbers. If I were comparing a Xeon Cluster to a G4 cluster, people would scream that it's apples and oranges - why does the same not hold true for intel CPUs?
  
  Parent Share
  twitter facebook
- - Re:Pentium 4.... non xeon? (Score:2)
    
    by ratfynk ( 456467 ) writes: on Wednesday October 22, 2003 @04:10PM (#7283861) Journal
    
    "2543 process cluster of Xeon 2.4's"
    Problem: MS Software, or optimising NIX or Linux for clustered smp. Could be the equivalent of a Clusters last stand, up against Apples injun' mod bsd cluster software.
    
    Parent Share
    twitter facebook
- Re:Unfair discounted price/performance (Score:2)
  
  by Space Coyote ( 413320 ) writes: on Wednesday October 22, 2003 @04:31PM (#7284113) Homepage
  
  That's pretty much the standard Apple education discount. Put away the tinfoil hat.
  
  Parent Share
  twitter facebook
- - Re:facts, please? (Score:3, Informative)
    
    by penguin7of9 ( 697383 ) writes: on Wednesday October 22, 2003 @06:41PM (#7285355)
    
    Thanks for the pointer. Now, about that "most cost effective" bit? Compared to what? At retail prices?
    
    Parent Share
    twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

A supercomputer by Any Other Name.... (Score:5, Interesting)

My computer is SUPER!!! (Score:2, Funny)

Re:A supercomputer by Any Other Name.... (Score:3, Insightful)

Re:A supercomputer by Any Other Name.... (Score:3, Funny)

Re:A supercomputer by Any Other Name.... (Score:4, Funny)

From the horse's mouth (Score:5, Interesting)

snazzy new G5 logo too! (Score:4, Funny)

Re:snazzy new G5 logo too! (Score:2, Funny)

Important items of note (Score:5, Informative)

Re:Important items of note (Score:2, Interesting)

Re:Important items of note (Score:2, Interesting)

Actually, it's already at 8.2 Tflop today (Oct 22) (Score:3, Informative)

Re:Important items of note (Score:5, Informative)

Re:Important items of note (Score:5, Insightful)

Not really (Score:5, Informative)

Re:Important items of note (Score:3, Informative)

Also Important? (Score:3, Informative)

Re:Important items of note (Score:2)

Re:Important items of note (Score:2)

Re:Important items of note (Score:2)

AltiVec won't help here (Score:5, Informative)

the REAL reason to build a top-5 supercomputer (Score:5, Funny)

Re:the REAL reason to build a top-5 supercomputer (Score:2)

Too good to be true... (Score:5, Insightful)

PARENT IS NOT A TROLL (Score:2)

new updated troll (Score:3, Funny)

Big mac cluster.. (Score:5, Funny)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:3, Funny)

Re:Big mac cluster.. (Score:5, Funny)

Re:Big mac cluster.. (Score:4, Informative)

Re:Big mac cluster.. (Score:3, Informative)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:5, Informative)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:2, Funny)

Re:Big mac cluster.. (Score:2)

Re:Big mac cluster.. (Score:2)

Instant Numbers... (Score:3, Insightful)

Does anyone else have trouble reconciling... (Score:5, Funny)

Re:Does anyone else have trouble reconciling... (Score:3, Insightful)

It's a good price/performance, but not best. (Score:3, Interesting)

Catch Phrase (Score:4, Funny)

This is NOT all that surprising. (Score:5, Insightful)

Re:This is NOT all that surprising. (Score:2, Informative)

Re:This is NOT all that surprising. (Score:2)

Re:This is NOT all that surprising. (Score:5, Informative)

Re:This is NOT all that surprising. (Score:2)

Re:I/O bandwidth and latency (Score:3, Informative)

Re:I/O bandwidth and latency (Score:3, Informative)

The Mac cluster is still on top per CPU (Score:5, Interesting)

Re:The Mac cluster is still on top per CPU (Score:2)

Re:The Mac cluster is still on top per CPU (Score:2)

It's all about AMD (Score:2)

It's really fixed this time!! (Score:3, Informative)

Re:The Mac cluster is still on top per CPU (Score:3, Informative)

Now at 8.2 Tflop as of today (Oct 22) (Score:5, Informative)

Re:Now at 8.2 Tflop as of today (Oct 22) (Score:2)

Re:Now at 8.2 Tflop as of today (Oct 22) (Score:2)

Big Mac? How does that compare with a WOPR? (Score:5, Funny)

And mac fans are complaining? (Score:2)

Re:And mac fans are complaining? (Score:2)

Re:And mac fans are complaining? (Score:5, Interesting)

Re:What "commodity"? (Score:2)

They didn't save the world AGAIN? (Score:4, Insightful)

Re:They didn't save the world AGAIN? (Score:2)

What about the RAM? (Score:2)

I am sick and tired... (Score:3, Funny)

Moore's Law applied (Score:3, Interesting)

Re:Moore's Law applied (Score:2)

Efficiency: switch topology? (Score:3, Insightful)

Re:Efficiency: switch topology? (Score:2)

Price vs Preformance (Score:2, Interesting)

Re:Price vs Preformance (Score:2)

Re:Price vs Preformance (Score:2)

Re:Price vs Preformance (Score:2)

Re:Price vs Preformance: Off an order of magnitude (Score:5, Interesting)

Re:Price vs Preformance: Off an order of magnitude (Score:3, Informative)