New Power Macs Have Crippled DDR Memory? 82
eggboard writes "According to Rob Art Morgan, who has tested this, the new Power Macs from Apple that use DDR (double data rate) memory -- like the Xserve rank-mount unit -- cannot access the memory any faster than the cheaper and slower SDRAM found in the previous system arch. A controller limits the data rate to 1 GB/s, while DDR could work more than twice as fast. Unfortunately, this makes mincemeat of the architecture, as it bus-/memory-bounds 2D and 3D graphics and rendering."
More information (Score:4, Informative)
It's Motorola, folks... (Score:4, Interesting)
The closest Moto has gotten is a 8xxx series "G5" processor that supports a RapidIO interconnect. However, this new processor, despite the existence of demo units dating back years, is still effectively vaporware. My understanding is that Apple is backing an interconnect technology called HyperTransport instead.
Any insiders willing to clarify or correct this? Motorola's current financial state is distressing, especially since I live near where they are based. All those layoffs...
Re:It's Motorola, folks... (Score:3, Insightful)
Just a roumour thogh. File it away in the round-file.
Re:It's Motorola, folks... (Score:1)
Re:It's Motorola, folks... (Score:2)
The clones were older powerpc based - however they we all going to transition to G3 sooner rather than later, some even went so far as to inclue a G3 daughter card inorder to get around Apples obnoxious legal department. here for more info
Not only that, but MOT had it's own Mac clone that got squashed - though it was a small endevour compaired to the roumoured G3 ramp up.
more info. [wired.com]
according to the article, this move my apple cost MOT $95 million, who knows how much monet MOT wased on G3?
Re:It's Motorola, folks... (Score:4, Informative)
Besides, how many G3's would MOTO have sold if Apple went bankrupt? If cloning had continued, they would be gone by now.
Re:It's Motorola, folks... (Score:2)
Yes and before that the whole company ran on Macs (both Apple and Mot)... after the clone fiasco they started switching to PCs.
Re:It's Motorola, folks... (Score:2)
Motorola already has a G5 out... the MPC8560 : PowerQUICC III Integrated Communications Processor
It's an embeded processor, not very fast either, 600 MHz - 1 GHz, but this is of interest:
Re:Meet the new boss... (Score:5, Insightful)
Motorola makes the CPU. Motorola makes CPUs that don't support DDR access to memory.
Apple makes the motherboards. Apple implements DDR memory everywhere except in the CPU.
IBM just announced vector computing support in the Power4. Vector computing is a big part of Apple's strategy, and until recently it was only available from Motorola.
The new Power Macs have a heat dissipation mechanism that's capable of dealing with many times the heat load of the currently shipping systems. They added this feature despite the fact that the previous generation of Power Macs had no particular heat problems.
Can't you read the writing on the wall? Apple has designed this new Power Mac to accept new, faster processors, and lots of 'em. A four-processor system is not unreasonable given the amount of space and heat dissipation inside this new chassis. Six or eight processors might even be possible, if everything comes together just right.
Don't assume you have the first idea what Apple has planned until you get all the facts.
Re:Meet the new boss... (Score:1)
The difference is this:
1. Everything except the tasks that are limited by CPU throughput will probably be slightly faster.
2. When new CPU's come out, you will probably be able to slap them in and gain full benifit from them.
But if you really think there's no difference, buy a previous-generation G4 from somebody on eBay and you will get a great price for a machine you believe to be just as good.
Re:Meet the new boss... (Score:1, Insightful)
Yeah sure, you can get cpu upgrades from other people, assuming they aren't out of business (XLR8), selling overclocked parts as the real deal (powerlogix), or can actually get G4's > 500 MHz (for the longest time only apple even HAD parts > 500 MHz).
Apple has fucked up royally. They figured duct-taping in the DDR would fool everyone into thinking the machines had gotten a speed boost, when the only boost is the addition of the 1.25GHz unit.
Re:Meet the new boss... (Score:1)
No, they also gain the boost to AGP performance that DDR brings. Even if the chip is not ready for it, putting in DDR was a good move. Or perhaps you would have preferred they stayed with PC133?
Re:Meet the new boss... (Score:1)
Actually it was Sonnet who was involved in the "overclock fiasco" which wasn't actually a fiasco at all since the chips were rated higher from IBM in the first place but oh well...you can't stop the machine once it gets rolling now can you?
Re:Meet the new boss... (Score:2)
Re:Meet the new boss... (Score:1)
Re:Meet the new boss... (Score:2, Informative)
Two apparent reasons:
(1) Apple wasn't selling near the PowerMacs that they were a year ago. The line needed something shiny and new.
(2) Apple was probably designing this new motherboard for a CPU that can handle HyperTransport (hopefully the next PowerMac) and decided to release it now for reason (1).
That's the way I see it, anyway.
Re:Meet the new boss... (Score:2)
Re:Meet the new boss... (Score:1)
Re:Meet the new boss... (Score:2)
The real story as I understand it is quite an old one - Apple is screwed because of the continued dependence on Motorola. You see, the G4 processors made by Motorola can't connect directly to DDR memory, requiring this type of kludgy go-between.
I for one am all for seeing Apple abandon Motorola within the next couple of years...and PPC in general if need be...and move to cheaper comodity CPU parts by Intel or AMD. I suspect we could kiss goodbye the tiny, quiet, cool (temperature) designs...but it probably will be worth it.
Re:Meet the new boss... (Score:2, Informative)
Oh, please! Be fair, the machines didn't just "not go up in price." They went down in price! The new dual 1GHz machines now sell at the mid-level price, several hundred dollars less than the old 1GHz. Even if the performance was basically unchanged across the board (which I don't beleive the Bare Feats benchmark proves anyway) the price/performance ratio has definitely improved.
- Dennis D.
Go read the Ars Technica forms (Score:4, Informative)
before drawing any conclusions from this article.
Motorola (Score:1)
Maybe this move is hoping partly to twist Motorola's arm that this is how it's gonna be with RAM so they should get their act together?
I actually just hope that Apple ditches Motorola altogether and lets IBM do their thing.
Re:Motorola (Score:1)
It's a shame, though, that they didn't use plain sdram and make the system cheaper... unless, there's something in the works for a plug-in replacement cpu supporting faster memory througput?
Re:Motorola (Score:2, Insightful)
Then if that falls through somehow, they'll be ready to move to Intel.
Just my feeling on it.
Re:Motorola (Score:2)
If apple went with intel, they would piss off the rabid fans, they would piss off the PC comunity, they would piss off the anti intel people and they would cause sever issues for new users.
But that' just me talking
Re:Motorola (Score:2)
Haven't heard any rumors about that though, surprisingly.
Re:Motorola (Score:1)
Re:Motorola (Score:1)
If I may do a bit of rumourmongering, I'd like to say that we may expect a quad-processor G5, but with the G5 running at, say, 800 MHz. I mean, we may not have superhigh frequencies on our processors (or multiplier values such as 21.5), but you can make sure we'll have the most processors we can get in there!
Server? (Score:5, Funny)
These so-called benchmarks (Score:4, Insightful)
The base configs of each machine are NOT listed.
The base OS configs of each machine are NOT listed.
The combined running configs, ie, size of objects, optional software (especially 3rd party apps and gui-players), etc, etc.
Guess what - each of the above - without running a single line or click of a benchmark can help in determining the outcome.
I'll wait to see how bad or good the new machines are - but I can tell you in advance, the old dual 1Ghz machines and the new ones are not identical at all in the CPU area.
Some folks have to learn to read and understand specs before jumping up and down and screamming.
Just my 2 cents, from the peanut gallery here in NY
Gil
Re:Well, it WAS a benchmark of an Apple machine (Score:2)
Re:These so-called benchmarks (Score:2)
CPU Bad, DMA and AGP Good (Score:4, Informative)
Re:CPU Bad, DMA and AGP Good (Score:1)
Graphics/rendering bus-limiting remains untested (Score:4, Insightful)
The data rate between CPU and RAM is limited to 1.3 GB/s. However there is still more than 1.3 GB/s of bandwidth for the GPU (AGP 4x which goes at about 0.5 GB/s), DMA calls from hard disks, etc. So graphics and rendering are not strictly bus-limited, as the GPU can never fully stress the bus. Furthermore, the GPU wasn't tested in the BareFeats benchmarks!
Furthermore, don't forget that the L3 cache on the new 1GHz Macs is only 1 MB, not 2 MB as it was in the previous 1GHz Macs (and as it remains in the 1.25 GHz Macs).
All these benchmarks teach us is that CPU-limited tasks like those posted at BareFeats are not a good test of the added throughput between the system controller and RAM. We need to see benchmarks that stress all of the throughput, not just the portion between CPU, controller, and RAM.
Re:Graphics/rendering bus-limiting remains unteste (Score:2)
What I haven't heard anyone talking about is some of the groundwork laid out for later, when they can remove the CPU bottleneck. Some of the more interesting features of the Xserve architechture are: Intervention [apple.com] and Write Combining [apple.com]. Funny, the things revealed by a little research...
I'll keep my QuickSilver 933 for now. Jaguar promises good performance gains, and that's worth the $129 if I save about one hour for one client. (or worth about $500 if Jag gives me an extra hour of quoteunquote free time)
Re:Graphics/rendering bus-limiting remains unteste (Score:2, Insightful)
Anyway, the thing that stands out about these benchmarks is that the new dual 1GHz has a system bus (and memory bandwidth) that is 25% faster than the old one, yet this made no discernable difference in these particular benchmarks. This isn't surprising considering that these are CPU-intensive tests, but the bizarre thing is the number of people who are claiming that this somehow proves that the DDR implementation is useless, a fraud, etc. etc. They seem to think these benchmarks would improve dramatically if the DDR bandwidth was passed on to the CPUs.
This isn't logical. Why would a real 25% improvement in memory bandwidth have no influence at all on the benchmarks, yet a 100% improvement which would come from a "real DDR" implementation suddenly make a big difference.
The results of these particular benchmarks would be the same because they are CPU intensive tasks and therfore are bound by CPU speed. (That is, for these tasks much more time is spent in CPU processing that in reading or writing memory). There's no magic change Apple could make in other parts of the system that will make 1GHz CPUs process faster than 1GHz.
That isn't to say that the 166MHz system bus and the DDR implementation Apple is using isn't advantageous in generaly system usage, it is just to say that these benchmarks will not reveal those advantages.
- Dennis D.
Misinformation at it's finest (Score:5, Informative)
Yes the new motherboards are not full DDR, this is mainly because the processors available from Motorola cannot handle DDR FSB's and therefore a full DDR motherboard. This is a shame, but it is far from crippled DDR RAM. Many early DDR RAM x86 motherboards were the same, only the RAM was DDR, not the full motherboard and processor FSB. While this does mean there is still a bottleneck (in certain tasks) between the processor and other components there are advantages to having DDR RAM. The tests at barefeats are using purely CPU limited operations, which will obviously show no real improvement as there has been no CPU or bus change (although the new 1Ghz procs have only 1Mb of DDR L3 cache versus the old 2Mb DDR and a 167Mhz bus version is available). What DDR RAM will help with is when there are a variety of components (CPU, HD, network, AGP, PCI, Firewire, etc) all vying for valuable memory bandwidth. It's these 'real-world' situations when we will see a performance increase. If you just run single process, purely CPU intensive tasks then maybe these machines aren't for you, but if you run a lot of stuff at the same time, or anything that uses CPU, HD, AGP etc intensively and concurrently then you should see an improvement. Things like Quartz Extreme will be throwing a LOT of data at the AGP bus, with DDR RAM this won't have to wait it's turn while say your CPU is busy grabbing all the bandwidth. I'd say many users have a lot of HD, CPU, GPU and network activity going on simultaneously, especially 'power' users. Hopefully we'll see some more benchmarks that show a variety of tasks being performed on these new machines once more people (and more reputable sites) get hold of the machines. While not fulfilling everyones dreams, I'm sure that the statements about the DDR RAM additions being a "waste" or "crippled" will be shown to be entirely false.
Re:Misinformation at it's finest (Score:2, Insightful)
I've seen a lot of comments on various web sites about the new 1Ghz bemoaning its lack of a DDR front side bus. Though I too am a little disappointed I think everyone has got sucked into the Apple marketing distortion field. I'm also disappointed to see comments on a few sites saying "clearly" the dual CPUs in the "Wind Tunnel" G4s are FSB bandwidth limited.
These claims require proof and the proof just isn't there. These best counter argument I've seen so far was a comment on xlr8yourmac.com
http://xlr8yourmac.com/archives/aug02/081402.ht
where a user reported:
1) A quick check shows it to be 3 (and bit) times the performance of my 667 Mhz G4 system (7450 processor). It scaled linearly (e.g. 2 * (1000/667)) despite the improved memory system. [BTW - the new dual 1GHz has 1MB DDR L3 per CPU, vs 2MB DDR L3 cache per CPU with the dual 1GHz Quicksilver model] The FSB is clearly SDR from the documentation and performance. The memory system is DDR. I need to run more tests.
Hmm that "clearly" word again.
Well the 7455 bus is still SDR but the thing to note in this report is the "performance scaled linearly" with clock speed. As both machines use similar CPUs (7450 in one and 7455 the newer there are no large changes in the CPU design) the conclusion we draw from this is that the CPU is *not* memory I/O bound (i.e. FSB bandwidth bound). If it was the increase in performance would be less than 2 * (1000/667) times. So running both CPUs flat out doesn't saturate the memory bus (and all the usual other traffic is kept off the internal bus by the IO controller if it moved by DMA transfers).
It also implies that for most applications (the tester doesn't describe the tests they used so whatever test they used) the 1Mbyte per CPU cache is sufficient and its loss
I'd like to see more measurements done to confirm this hypothesis but it looks like magically speeding up the bus won't cause the CPU through to improve dramatically. The way to do this (if anyone has a chance is to run some CPU and memory bound applications on the all three models (and try to correct for the different cache sizes and FSB speeds) but if you see close the linear relationship then the CPUs are certainly not held back by the front side bus.
And h0tblack has a point that DMA will get more of a workout with Quartz Extreme though I suspect it will be less than he expects (most of the stuff should be in the GPU VRAM for compositing and anything that gets there will have to be worked on by the CPU to some extent at least once).
Re:Misinformation at it's finest (Score:5, Informative)
It is really really hard to keep a dual 1Ghz machine fed when a single instruction (taking a single cycle) can process 16bytes of information.
If you had a simple filter for example, a blur.. that could be executed in perhaps 10 cycles...
which would require 3.2GB/s of bandwidth to run at full speed (1.6GB in, 1.6GB out), and on a dual... 6.4GB/s (which happens to be the bandwidth on that new IBM PowerPC
The current bus can only provide 1.3GB/s
Which means this filter would run at 40% of the full speed...
If its running on two cpus, then its going to run at 20% of full speed.
This means DDRing the bus would double the performance... but you'd still only be running at 40% of full speed.
AltiVec generally converts almost any relatively complex operation into a memory benchmark.
Since altivec is used for the most time critical parts of a program, faster memory would allow these time critical parts to run x times faster...
Anywho, when it takes only two cycles to multiply 16 values by another 16 values, then add another 16 values, and saturate the result (something which would maybe take 80 cycles without altivec, memory bandwidth becomes the limiting factor. (for those counting that's a 40x improvement, the equivalent of a 40GHz chip if it was running scalar code)
Its even worse on 7450s because the AV unit can execute multiple instructions concurrently.
G4s *ARE* memory constrained, I'd say even seriously.
Small benchmarks will not expose this as they'll almost always run out of L3, or even L2 (L1!) cache.
BUT real world operations normally work on massive data sets...
(be it video, audio, 2D, 3D, genetic sequences, or just your window being composited with a menu)
Incidentally, the speed improvements from altivec can generally be worked out as 4x, 8x, or 16x for most uses depending if you can use 8, 16 or 32bit math. Some operations can make use of tricks altivec can do and scalar can't. which allows speeds of 32x (or even more)
Running a highly optimized calculation which is NOT memory bound we've managed to come up with some interesting numbers
The algorithm was highly optimized for MMX and AltiVec,
running on a single G4/500, with many other applications running etc, the calculation was over twice as fast, as the same calculation on an athlon 1.3Ghz. The G4 has a 100Mhz bus, the athlon has DDR266, but it doesn't matter because the process is not memory bound.
this means it took 15 mins on the G4/500 and 30mins on the athlon/1300.
(the athlon was running NOTHING else)
Re:Misinformation, Lies and Statistics. (Score:2)
Re:Misinformation, Lies and Statistics. (Score:2)
And for what it's worth, Slashdot's article selection has been nothing, if not pro-Apple.
Re:Misinformation, Lies and Statistics. (Score:2)
So, of course, pointing out that the article is a troll and flamebait is the equivilent of trolling and flame baiting and my post has been moderated as such.
Yeah, Slashdot is really pro-apple.
Sheesh.
There is no science here, no rationality, no technology. And its sad. There should be a place for geeks to congregate online.
Not to Worry. (Score:2)
In any case, the new systems are still a great buy. It's a UNIX box, folks. More processors mean more processes. At least the systems aren't SLOWER. I take the benchmarks from Bare Feats with a grain of salts. As the saying goes, your mileage may vary. I'm betting these systems will rock when the Mac version of Jedi Knight II shows up.
Can someone please explain (Score:1)
Re:Can someone please explain (Score:5, Informative)
Unfortunately the AIM alliance partners seem to have increasingly divergent needs from the partnership. IBM wanted PowerPC for servers, and sees Altivec as a gaudy tack-on to their architecture. They still produce the "G3" chips, at ever higher and higher clock speeds. Apple can't use them though because of the MHz. myth. IBM's stance towards Altivec appears to be weakening though with their upcoming chip.
Motorola wants PowerPC for embedded stuff, and Altivec makes it easy to do DSP like functions in a general purpose processor.
Apple needs the PowerPC for everything but the iPod. They need Altivec to make MacOS X so cool for consumers and scientists. Since IBM won't license it, they are stuck with the only producer, Motorola.
It's times like these that I wish there were some truth to the old rumors about Apple buying Motorola's PowerPC fabs. If that were the case, Apple could produce the exact chips that Apple needs, not what IBM or Moto wants. Unfortunately, there isn't any indication that this would be profitable or feasible for them.
-- Len
Re:Can someone please explain (Score:1)
Re:Can someone please explain (Score:3, Informative)
Wrong, wrong, wrong (Score:5, Informative)
It is clear that Motorola and Apple have grown apart. Apple has had big problems with them for a long time and has looked at other options. They didn't choose to buy Motorola's awful fabs, it's too late to do that know (nor is it smart). No, credible rumors point to IBM. It makes a lot of sense:
- IBM wants to sell more low-range (Linux) servers, so they already need a fast desktop CPU. Why not sell it to someone else as well?
- Apple has a lot of experience with Altivec, it makes sense to work with them to produce this chip (Apple employs some very smart chip-designers).
- Altivec is a respected instruction set. It's proven to work (no need to reinvent the wheel on a risky venture). Tools are available. GCC supports it (and since their servers will run Linux...).
In 6-9 months we'll probably have a 64 bit PowerMac that is very competitive. I can't wait.
Re:Wrong, wrong, wrong (Score:1)
I'm sorry, but I think that this is a widely optimistic time frame. The chip won't even be discussed in a conference for another two months, and you expect it to be shipping in a Mac remade for 64 bit 4-7 months after that? I don't think it will be happening any time next year.
Re:Wrong, wrong, wrong (Score:1)
Re:Can someone please explain (Score:1, Flamebait)
By "MHz myth", you must be referring to the fact [heise.de] that G4's are not performing much faster than a Pentium of similar clock speed.
Apple makes nice machines and software, but they really do need to get their act together on performance.
Re:Can someone please explain (Score:2)
I don't think it's as bad as it looks (Score:1)
The benchmarks are also poor. They appear to mainly be CPU dependent, not memory bandwidth dependent.
Not so, here is the real truth (Score:1)
The six-color logo was inspired by a series of print posters make for Ford Motor company, think late seventies design.
How do I know this, I used to work MIS there and we had piles of old Macs, including lots of the really early models. Alas, they no longer do consulting to Apple, nor do they use Macintoshes anymore. Held out until 1998 though, so don't bag on them too much.
CPU intensive benchmarks... (Score:1)
bogus benchmarks (Score:2)
The only major manufacturer that seems to be missing official SPEC results is Apple. Instead, we get bogus and irreproducible benchmarks like Photoshop and Bryce, both from Apple and from benchmark sites like these.
Why? Is Apple afraid of backing up their claims of "supercomputer performance" with actual facts? Inofficial SPEC benchmarks have shown the G4 not to be all that much faster than a Pentium with similar clock speed.
Re:bogus benchmarks (Score:1)
A G4s integer performance is acceptable but it's floating point unit is sadly lacking. Altivec is the only reason that G4's scream at floating point math. With a vector processing unit that handles floats like that you don't need a really strong general purpose floating point unit. The only annoying thing missing from Altivec is support for double-precision floats.
If GCC were made to auto vectorise code for Altivec then the Spec benchmark results on the G4 would increase dramatically. Then you might see something closer to the real picture and they might be worth Apple publishing.
Quality of benchmarks? (Score:2)
Does anyone know if these benchmarks test memory throughput at all? I have no idea what the Photoshop "MP" action file test is, or why it's there twice, so I'm mainly curious about that. I can't imagine that MP3 encoding or rendering in Bryce aren't completely CPU-bound.