Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Drooling Over VA Tech's 1100-Node G5 Cluster

Posted by pudge on Sat Sep 20, 2003 11:46 PM
from the more-pics-wanted dept.
Mr. Slurpee writes "Virginia Tech's 1100-node dual 2 GHz Apple G5 Terascale Cluster is getting racked up and ready to roar. If you're a penniless geek like me, at least there's some tech pr0n for us to drool over. There's 1100 of them ... think they could part with one?" Update: 09/22 02:55 GMT by T : Matt submits a link to this full mirror of the photos, writing "The page owner's comment on the original mirror being taken down due to bandwidth? 'Bring it on!'"
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by Anonymous Coward on Saturday September 20 2003, @11:48PM (#7015595)
    Imagine a beowulf clu...oh, wait.
  • by Anonymous Coward on Saturday September 20 2003, @11:50PM (#7015597)
    here [cmu.edu].
  • by jtnishi (610495) on Saturday September 20 2003, @11:50PM (#7015598) Homepage
    Oh God.

    Imagining each one of those came with just a little bit of Steve Job's Reality Distortion Field, someone from NASA might want to head over there and make sure that some kind of tear in space/time doesn't occur right there. With that many G5s, we don't know what level of destruction could happen.
  • space.. (Score:4, Insightful)

    by kidlinux (2550) <duke.spacebox@net> on Saturday September 20 2003, @11:52PM (#7015610) Homepage
    Using full sized cases seems like a rather inefficient use of space to me. But I guess those cases are all fairly full - the heatsinks in those things are enormous. Wish PCs had heatsinks like that, then maybe mine wouldn't be so noisy.
    • Re:space.. (Score:4, Funny)

      by entartete (659190) on Saturday September 20 2003, @11:59PM (#7015655)
      the full sized cases will provide greater resistance to all the spoooge that will be sprayed over them by spontaneous orgasms of the hordes of apple fans comeing to worship before the mother of all apples.
      • Re:space.. (Score:4, Funny)

        by SoupIsGoodFood_42 (521389) on Sunday September 21 2003, @02:22AM (#7016207) Homepage
        the full sized cases will provide greater resistance to all the spoooge...

        Yeah, except for the fact that the front is covered in a mesh of holes, with fans sucking air through them. Maybe that's why you can see clear plastic hanging down in some of the photos.

      • I love it (Score:4, Funny)

        by FredFnord (635797) on Sunday September 21 2003, @02:41AM (#7016281)
        Funniest thing I've seen all day.

        And I'm a Mac guy, too. I wouldn't mind wandering through that room for a while myself... though I probably would keep my pants on.

        -fred
  • by Xpilot (117961) on Saturday September 20 2003, @11:52PM (#7015614) Homepage
    This presentation contains content that your browser may not be able to show properly. This presentation was optimized for more recent versions of Microsoft Internet Explorer.

    Why make a website "optimized for IE", when the content of the said website is of interest to people who are probably not running IE or Windows?
    • The simply created a PowerPoint presentation and converted it to "HTML". Actually, it doesn't even appear to *try* to be HTML - I have no idea what this crap is. Through their laziness, those morons managed to make their very simple page inacessible to any browser but Internet Explorer (well, neither Opera 7 nor Moz 1.4 render the page, from what I can see, maybe it works in KHTML-based browsers, but I doubt it). Oh well, Search Engines won't be able to index the content, and that'll be their loss

        • One of the reasons VTech went for a G5 based cluster WAS price-performance...The mac option is cheaper then a PC aption and easier to install and maintain then Linux says the slide show. I might not fully agree, but thats their reasoning.
  • How long... (Score:3, Insightful)

    by FooGoo (98336) on Saturday September 20 2003, @11:52PM (#7015618)
    do you think it would take this nifty cluster to correct the barrel distorition from their wide angle lesnse?

    FooGoo
  • Where is mine? (Score:4, Insightful)

    by TiMac (621390) on Saturday September 20 2003, @11:53PM (#7015620)
    So now the question is...

    They've got 1100+...where's mine? I ordered a Dual 2.0 GHz G5 in July....still no sight. Supposed to ship on Tuesday....but online time will tell....

    Sigh...Maybe they'll loan me one if mine gets delayed!

    PS--anyone got the rest of these pics? There were a TON of them...Mirror? COMPLETE?

  • Hm. (Score:5, Interesting)

    by ixt (463433) on Saturday September 20 2003, @11:55PM (#7015633)
    What I really would like to know is how they install and configure all those machines. Their method of doing that will be very useful for even the (relatively) smaller networks that don't necessarily have to be clusters.

    For example, I've yet to figure out a way to effectively get a computer lab with 30 eMacs installed and configured the same way. DHCP/Netboot is slow because we only have 100mbit switches. Split CD images are slow, and Jaguar doesn't yet have free software that does that yet (besides the dd of course). I'm not sure how to keep them all updated either.

    I really hope they describe how they maintain the operating system on them.
    • NetBoot is slow? (Score:5, Insightful)

      by 2nd Post! (213333) <`gundbear' `at' `pacbell.net'> on Sunday September 21 2003, @12:06AM (#7015681) Homepage
      I haven't actually tried it yet since I don't have access to enough Macs, but I imagine it's something you would start and let happen overnight... I mean, that's more or less how Apple does it in their own stores, wipe and restore overnight, I think. Or at least after the store closes and before the next opening day.
    • Re:Hm. (Score:4, Informative)

      by tecnobabble (611104) on Sunday September 21 2003, @12:45AM (#7015830)
      Actually, I just ran into this problem with a new eMac lab at my school, we set it up in about 4 hours. :)

      Easy way to do it.

      1. Set up 1 machine how you want it.
      2. Get a bunch of firewire cables.
      3. Hook the eMac's together using the cables. (If you can't reach with the cables, get some portable firewire drives, iPods work well with this too.)
      4. Use Carbon Copy Cloner 2.2 (http://www.bombich.com/software/ccc.html) and move down the line of machines until they're all the same.
      5. Go in and change HD, Network, etc names.
      6. Smile because you just did something in 4-5 hours that it would take Windows users a week to do.

      If you have questions, feel free to email (sethmath @ mac.com) me about it. I can walk you through if necessary.
      • Re:Hm. (Score:3, Informative)

        by gerardrj (207690) *
        I'm as big a Mac fan as anyone, but this would not take a Windows user a week. There are several apps that will mirror and restore HDs in a matter of minutes. Over a 10bT network I used to use Ghost to generate over 25 Win95 installs an hour, just by myself.
        1. Boot to floppy
        2. Press menu option for image to install
        3. Boot machine
        4. Change HD, Network, etc names

        I don't know what the average user/site would encounter with the WinXP authorization, but I know larger sites get blanket installation without the c
        • Re:Hm. (Score:4, Informative)

          by chrome (3506) <chrome@@@stupendous...net> on Sunday September 21 2003, @04:58AM (#7016568) Homepage Journal
          Exactly.

          Except, with Ghost, you could install 1000 machines in 30 minutes - using multicast.

          A couple might fail and you'd have to redo them, but if you have a 100Mbit switched network (or gig, even better) then its about 30 minutes to blast a Windows 2000 install to any number of machines.

          I love macs, typing this on a PB17", but all the apple zealots out there really make me ashamed sometimes.

          Macs are strong in some areas, and weak in others. If it wins in something, DONT RUB PEOPLES FACES IN IT. They don't care.

          Get over it.
      • Re:Hm. (Score:5, Informative)

        by Benley (102665) on Sunday September 21 2003, @01:37AM (#7016037) Homepage Journal

        I run a lab with about 50 macs (assorted models, from 350mhz iMacs through 800mhz eMacs, and a few 1ghz G4's) - I spent a good bit of time on a solution, and it's really not as hard as this thread makes it sound.

        First, I build one system and set it up *Exactly* the way I want all the others to be. I have some run-once script voodoo to set the IP address of each machine based on its Mac address, and to munge some ByHost user preferences for the built-in guest account. Then, I use Carbon Copy Cloner">Carbon Copy Cloner [bombich.com] to create an image of that machine's hard drive.

        Once I have an image of the machine, I use NetRestore [bombich.com]NetRestore (by the same guy as CCC) to create a netboot image that will automatically install the master machine's HD image onto each client.

        I am fortunate to have a MacOS X Server machine on which to run the NetBoot server - which is independent of the subnet's master DHCP server, I might add - but it is possible to netboot macs from other Unix machines with a bit of patching to dhcpd.

        Anyhow, all in all I don't find it any more difficult to netinstall Macs than it is to do the same for Windows machines. Building the master clone image is time consuming and annoying, but it always will be for any platform.

        Feel free to email me if you are interested in my machine setup voodoo script. I had to borrow some binaries from OS X Server in order to make it work. It's slowly turning into something useful as I add more functionality to it.

  • by idontgno (624372) on Saturday September 20 2003, @11:56PM (#7015637) Journal
    I was doing some window shopping in a large entertainment electronics/computers store tonight, and started playing with a G5 uniprocessor. The salesdrone drifted over and started his sales rap, and I busted in to ask about the dual-processor G5s.

    He had to admit they didn't have any in stock, and weren't expecting to get any from Apple for some time.

    I guess I know where the dual-G5 systems are all going. Ah, well, it's all for a good cause. I hope.

    • by entartete (659190) on Sunday September 21 2003, @12:05AM (#7015672)
      we got the dual processor g5 we ordered in at the university I work at and have had it for a while. I guess apple is really pushing to make the institutional customers happy. and if a university gets a g5 dozens of students can play with it and drool over it and become filled with g5 lust while if one regular customer gets one they'll just hide in their room mumbling about 'my preciousssss' and fondling it and that's not very good advertising and is sorta creepy.
  • Cocktease (Score:4, Interesting)

    by mrpuffypants (444598) * <mrpuffypants&gmail,com> on Sunday September 21 2003, @12:02AM (#7015663)
    It's a bit of a cocktease to post this link right now...Most of the mac community sites linked to the pictures at Virginia Tech's site but brought it down. Try clicking on the "pictures" link on their site and you'll se that they chmod 0'd the whole site so that the bandwidth usage won't peak out again

    The pics at chaosmint are a small selection of what was originally on the site.

    But to be on topic I'm suprised that Apple didn't get them Xserve G5's for the cluster. While the desktop G5's look cool it's really unneeded to use up all that space.
    • Re:Cocktease (Score:3, Insightful)

      by Maktoo (16901)
      That's because there is no such thing from Apple yet. Given the serious engineering that obviously went into the G5 case, I don't think we can assume that they can just drop a G5 into a current XServe and sell it.

      Plus, these guys were on a pretty strict deadline, the cluster has to be functional by November IIRC. So, they wouldn't have wanted to wait any longer than they did.
  • by F2F (11474) on Sunday September 21 2003, @12:06AM (#7015679)
    Pink [lanl.gov] at LANL has the following:

    1024 nodes
    2048 cpus
    1024 power cables
    1024 Myrinet network cards
    2048 fiber cables (8.8 miles)
    3072 Myrinet switch ports
    4096 sticks of RAM (2 Terabytes)
    7168 fans
    1 hard drive
    1 CDROM drive

    Not only do they have pictures of its assembly, they have movies [lanl.gov].

    Check the web page for more stats and better quality movies.

    Oh, yes, it's unclassified :)
  • by stevens (84346) on Sunday September 21 2003, @12:18AM (#7015730) Homepage
    I hope they were able to run these without video cards. I can't imagine 1100 brand-new sweet ATI video cards sitting idle for years...
  • Ummmm.... (Score:3, Funny)

    by Y-Crate (540566) on Sunday September 21 2003, @12:21AM (#7015743)
    As a Professional * Information Technology Location Analyst and Physical Security Specialist I need to use my professional abilities to make a professional analysis of the situation for my professional collegues so that we may put forth a professional solution to this problem.**

    * - I really, really hate people who make gratutious use of the word "professional" as some sort of elitist mark of supremecy

    ** - I would like to run in there, see if the machines are locked down, and grab as many as I can hold.

    (And yes, I'm just joking, I don't want to steal anything from them and I neither have the plans nor the means to do so, it's just a joke people)
  • my first tech pr0n (Score:5, Interesting)

    by morcheeba (260908) on Sunday September 21 2003, @12:38AM (#7015804) Journal
    Ah, vivid memories of the cover of Softtalk magazine, with a picture of the Apple II assembly line with hundreds of machines. Just imagine... 200 * 64k = 12.5 MEGABYTES! That would take 90 floppies to store all that data!

    Now some statistic pr0n:
    There were about 5 1/2 million Apple IIs sold [interesting-people.org], so at an average of 64k each (just a guess), that would be 343 GB of memory total. Adding up the couple of computers in the office (it's a 4 person company), we're about 1/70 of the way there. Assuming 2 140K floppy drives per computer, that would be 1.5TB of disk storage -- that would be 6 hard drives, and they would occupy less space than a single pair of old floppy drives.
  • by Kirby-meister (574952) on Sunday September 21 2003, @01:20AM (#7015956)
    I volunteered time to help get some G5's ready for this baby, and I think my favorite moment was getting the tshirt all volunteers received:

    It is quite the fashion statement [vt.edu] :)

    (Excuse the blurriness and poor lighting - crappy cam and crappy dorm lighting)

  • by FredFnord (635797) on Sunday September 21 2003, @03:19AM (#7016383)
    Or do you just want to bitch?

    The real answer is that the problems that are going to be solved with this cluster are easily parallelizable. That's the IDEA, right? 1100 machines, each running one chunk. Well, the G5, and more specifically the Altivec vector processing section of it, is SO MUCH better for processing big bites of easily parallelizable data at a time than any of the alternatives that it can run rings around any Intel or AMD machine you care to name with fewer than double the number of processors. (And in the cases of some particular kinds of calculations, it beats those, too. But you can't count on that for all your problems.)

    We've seen this before a number of times... I seem to recall a gene sequencing program that was running five or six times faster on a G4 than it was on a Pentium IV of the same speed. And then there's SETI@home, which runs much faster, cycle-for-cycle, on the Mac, and doesn't even USE altivec. (Though I believe it does take advantage of the 'multiply-and-add' instruction of the PPC, which is another nice little feature.)

    Altivec is an astonishingly clean and usable interface for an amazingly powerful vector processor that is, in 99% of the Macs out there, underutilized to the point that if it suddenly disappeared, most people wouldn't notice any difference at all. It's kind of a pity, really.

    Basically, Intel came out with MMX (and all the later developments) in order to have a talking point on a slide presentation about their processors, about the time when competitors like AMD were starting to come forward: functionally, an awful mess, and impossibly difficult to program. (In fact, for the first few years, Intel would send programmers out to work with companies to implement MMX, because otherwise none of them would bother.

    AMD came up with something that was a little less hacked together in a very short period of time, as a response to Intel. But it still wasn't pretty, at least partially because of the limitations of the archetecture, and the performance wasn't *that* much better than just doing without.

    Apple (who really designed a lot of the basics themselves when it comes to Altivec, so don't think this was a Motorola invention) said, 'Hey, wow, we need something like that, in order to compete.' First they decided on a coprocessor, but that didn't fly any better with the PPC than it did with the older Macs (840av, 660av) with DSPs in them. So they sat down and came up with a really *good* spec for a set of multimedia extensions. And they've only gotten better since.

    I've toyed with altivec code, and I can tell you that in one application that I wrote, one instruction (vector permute) did the work of ten or more non-altiveced instructions on four times the data per cycle. Mind you, I just did it for fun, I don't know enough about parallel computing problems to come up with anything useful... but there's some interesting stuff under the hood.

    Of course, nobody is going to believe this, because as fashionable as it is to like MacOS X on slashdot these days, nobody wants to admit that, for *some* subset of problems, Mr. Jobs's reality distortion field might not be quite as much of a distortion as you might think...

    -fred
  • by Decaffeinated Jedi (648571) on Sunday September 21 2003, @07:23AM (#7016895) Homepage Journal
    Between the on-campus nuclear reactor and the supercomputer cluster, I'd keep an eye out if I were Tech's cross-state rival, University of Virginia. I'd say the Hokies are just one diabolical dean away from becoming an evil university bent on world domination. And five bucks says they start in Charlottesville.
    • Re:Why G5s? (Score:5, Informative)

      by 2nd Post! (213333) <`gundbear' `at' `pacbell.net'> on Saturday September 20 2003, @11:57PM (#7015643) Homepage
      Here's [chaosmint.com] why. Some of the more pertinent points:

      Dell - too expensive [one of the reasons for the project being so "hush hush" was that dell was exploring pricing options during bidding]

      Sun (sparc) - required too many processors, also too expensive

      IBM/AMD (opteron) - required twice the number of processors and was twice the price in the desired configuration; had no chassis available

      HP (itanium) - ditto

      Apple (IBM PPC970) - system available with chassis for lowest price
    • Re:Why G5s? (Score:4, Funny)

      by commodoresloat (172735) on Sunday September 21 2003, @05:51AM (#7016716)
      Yeah man I don't understand the big deal with G5s for this kind of application. I'm sitting here in front of my 1100-unit dual-G5 cluster at my freelance gig trying to copy a 17M file from one folder to another and it's taking over 20 nanoseconds. My Cray at home would be done with this already, and even my beowulf cluster of TRS-80s wouldn't take this long....
    • If you read the link [chaosmint.com] in one of the earlier comments, you would see that:
      Slide Four

      Choosing the Right Architechture

      cost vs. performance (purely)
      total cost $5.2 million includes system itself, memory, storage, and communication fabrics
      one of the cheapest systems of its kind

      Slide Five
      Architectural Options

      Dell - too expensive [one of the reasons for the project being so "hush hush" was that dell was exploring pricing options during bidding]
      Sun (sparc) - required too many processors, also too expensive
      IBM/AMD (opteron) - required twice the number of processors and was twice the price in the desired configuration; had no chassis available
      HP (itanium) - ditto
      Apple (IBM PPC970) - system available with chassis for lowest price
      • by WasterDave (20047) <davep&zedkep,com> on Sunday September 21 2003, @12:58AM (#7015876)
        IBM/AMD (opteron) - required twice the number of processors and was twice the price in the desired configuration; had no chassis available

        Y'know, I saw this presentation a few days ago. I wasn't there, I saw it on the net. Anyway, this bullet point stuck out then - like, what are they talking about?

        For one, how come it required twice the number of processors? From the benchmarks I've seen Opterons normally whup the G5, or are at least very competitive on paticularly G5 optimised code. Certainly not out by a factor of two, anyway.

        And no chassis? What the hell does this mean? You can get 1U, 2U and 4U beast Opteron boxes from the likes of, well, IBM for one. As mentioned above.

        It's not even like the kinda ropey nature of 64 bit Linux comes into play either because, well, there is no 64 bit OS X - unless VT know something we don't (which is always possible).

        So, yeah, I think someone decided to buy all the G5's made for a month and just set up the project to make it happen. This "achitectural options" thing is horseshit.

        Dave
        • Re: Processors
          Perhaps for their benchmarks, the G5 was 2x the performance of the Opteron. Have you taken into consideration the Altivec processor, which happens to be 128bit in size? Any vector processing will be enhanced greatly by the powerful nature of the G5 in general, and especially when using Altivec optimized code. Couple this with IBM's XLC auto-vectorizing C compiler, and I wouldn't be surprised if Altivec did wipe SSE2/3D!Now; it's been discussed before that Altivec is a superior solution to MMX/MMX2/SSE, and SSE2, so there's no reason to doubt that when you pump up the FSB from 167MHz->1GHz, pump up the CPU from 1.4GHz->2.0GHz, on the PowerPC architecture, that Altivec doesn't become the most powerful SIMD solution in commodity computing.

          Re: Chassis
          It may be a time of research vs time to market discrepancy; IE, at the time VT was requesting bids, there were no Opteron chassis announced or available, whilst Apple may have had at 95% completion, barring an actual press release and announcement. Like, simultaneous to the release of the G5 there are no IBM PPC 970 machines, yet both companies use the same CPU.

          Re: OS X
          Yeah, there is a 64 bit X. It's called OS X Panther, and there's a 64 bit aware X called 10.2.7, and the libraries for Altivec have been 128bit for years now, so all 10.2.7 really added was... 64 bit pointers and memory addresses, really.

          To recap: Altivec makes a big difference. Having immediately available machines makes a difference. Having a lower price point per performance per machine makes a difference (each node, including AC + networking + ram only costs about $4,727, which is $1,600 lower than an identically specced stock dual G5 with 4GB of ram!), as well as supportability of OS X vs Linux or, heaven forbid, Windows 2k... And yes, OS X for these machines are at least 64 bit enough to address 8GB of ram, and the OS has *always* been able to manipulate 128 bit data, as well as 64 bit data.
        • by gerardrj (207690) * on Sunday September 21 2003, @01:10AM (#7015916) Journal
          - It's pretty clear to me that Apple didn't divert anything. If you look at the numbers, the VT order accounts for about 1% of all Dual G5 orders. That's hardly enough to cause the delays that people are seeing in their ship dates. Notice the slide states Apple offered an "early september" ship date, but Apple initially promised customers a mid-late August date. Given when those talks between VT and Apple were likely taking place, that means that Apple had intended to fill other orders first, and had a special allotment for the VT order.

          - I don't know a whole lot about a blade center, but there doesn't seem to be a place to plug in the high-speed interconnects. Also, it runs on Intel chips that run hotter and do less work than the G5, especially when AltiVec gets involved, which is usually why you build a computer this size; vector processing. I'm also guessing the required configuration needed resale value to students at the end of life for the project/system.

          - That's absoloutly true. When you need technical details about Linux you have to dig. When you have a question about OS X's guts, I'd guess you call Apple and have a conference call with all the coders (at least at this level of purchase/prestige). Could you imagine trying to get Linus, and all the other code writers for Linux and the supporting libraries and utilites on the phone at once?
    • Reliability.
      No one in their right mind would try to argue that one couldn't build a home-grown system for less. But with optical ports? FW 400 and 800? Gigabit ethernet? USB 2.0? And with said home-grown machines, when the NIC goes bad in one, or a memory slot goes bad in another, who do you call? The NIC or mainboard manufacturer? So you what, keep a list of all your machines, give 'em i.d. numbers or whatever, itemize the guts and who made what (mainboard, NIC, RAM, CPU, HDD, etc.) of each, and hope to make sense of it all when stuff starts to fail? Me, if I was in charge of it, it would make sense to me to farm it all out to one company, and then when something breaks there is one number that I have to call.
      Also, lets not forget that this is probably going to be used for research, and if it involves vectors, then AltiVec is the SIMD for you.

      Of course, being human, my opinion is suspect.

      (tig)
    • It doesn't take pull... it takes money. Small schools have other significant advantages.

      When you choose between going to a large, research-oriented school and going to a smaller school, you're essentially making a trade-off between resources and personal attention. Bigger schools have more and deeper resources, but it can be tough for undergrads to have much significant interaction with professors, particularly in the first year or two. Smaller schools may not offer the same variety of courses, or get huge
      • A $2,000 Macintosh system will run just as well as a $900 self-built x86 system.

        Depends what you mean by "as well as"... That only applies if you aren't talking about heat output, power requirements, cooling required, decent case design, ease of servicing. Then, for the programs you are using, things like an extra-fast bus, large CPU cache, and posibility of huge ammounts of RAM, must not be important at all to you.

        So, sure, if those 8 things are not to be considered at all, then sure, you can say that

      • That argument is mute. Motorolla fucked Apple over bigtime. The G4 can not handle ddram ram. THe chipset slows the speed down to 133 sdram because the cpu can not handle anything faster. It created a huge bottleneck.

        The g5 is the IBM power4 in a lighter configuration. ITs the fastest desktop level cpu.

        You could probably build a dual smp pIV which will perform close but the G5 has better fpu's and is cheaper in an smp system then xeon based top of the line PIV. This is why they chose the G5.

        Also I do not