SpaceRay
![]() |
I remember that I have seen that some of you have a Mac Pro computer and with dual CPU, like Totte that have one, but I think it was with 8 cores.
Please, anyone have Mac Pro 12 cores to make a filter forge benchmark test? ALSO would be the same if anyone have a Dual Xeon 6 cores (12 cores) 2 Ghz or 2.2 GHz on Windows as it would be the same ---------------------------------------------------------- ---------------------------------------------------------- ---------------------------------------------------------- I want just to ask a very simple render of a 3000 x 3000 and 4000 x 4000 render time of some filter that you may like and have and that is not a photo filter, better if it was a FF self generated image but also can be other filter. I just want to know how much time does it take to render at this resolution on a 12 core computer. ---------------------------------------------------------- ---------------------------------------------------------- ---------------------------------------------------------- As on the internet there is NO reference in any web about any benchmark made with filter forge, I want to know the difference in perfomance from a dual cpu 6 cores. I know that the cores alone are not important, and what is really important is the perfomance and real world speed. I know that the MacPro 12 cores are based on two 2.4Ghz Intel Xeon E5645 . NOTE . I have made a copy and backup of this post it is lost as it happens lately . |
|||||||
Posted: November 18, 2012 9:20 am | ||||||||
Totte
![]() |
I have
![]() Just don't have FF installed on that box. - I never expected the Spanish inquisition |
|||||||
Posted: November 19, 2012 2:01 pm | ||||||||
Totte
![]() |
I have FF in it soon too.... Will test with some of my filters, can give you 12 Core plus 8 Core, with default AA settings. I'll be back!
- I never expected the Spanish inquisition |
|||||||
Posted: November 19, 2012 2:07 pm | ||||||||
Totte
![]() |
Baah! The "Slow elapsed time" seems to be buggy in 3.0. When I render a 4000x4000 to disk, the elapsed time shows the time it took to render the preview after the render was done, will try to test with disabled "Render smaller previews" to see if it helps.
- I never expected the Spanish inquisition |
|||||||
Posted: November 20, 2012 9:58 am | ||||||||
Totte
![]() |
used: http://www.filterforge.com/filters/9225.html
First preset, 4000x4000 11 minutes sharp. - I never expected the Spanish inquisition |
|||||||
Posted: November 20, 2012 10:14 am | ||||||||
SpaceRay
![]() |
WOW! Thanks very much Totte for your help and for making the test and making the benchmark with your bricks wall filter, it´s very appreciated BUT perhaps there may be something wrong.
You have said that the time taken for a 4000x4000 with the first preset is 11 minutes and this is with an 8 core computer ? I mean a 8 core (16 threads?) or is it an Quad core (8 threads) ? I think it should be with default settings too, Antialias 5 I am telling this and asking because when I have made my own benchmark on my own computer with a Quad Core i7 2600 3.4Ghz I got the time of 10 minutes 41 seconds (made it 3 times and always the same) How is possible that I have less time than you? ![]() ![]() ![]() Here below is the screenshot as a proof of what I got and to see if you have got the same as me. ![]() |
|||||||
Posted: November 20, 2012 10:31 pm | ||||||||
SpaceRay
![]() |
This happens only when you have the "reduced" preview activated instead of the "Actual" preview, so whatever size you have put it will always show you the time of the 600x600 render preview. So to know the real time of the 4000x4000 render you must put the "Actual" size preview. |
|||||||
Posted: November 20, 2012 11:32 pm | ||||||||
Skybase
![]() |
It's possible because there could have been "other background things" running while the render happens. The variability of render times can be seen when, for example, when the system runs Time Machine, a backup utility. And you really don't know what other operations are running until (least for us Mac users) take a peek at the activity monitor or via terminal.
However, in this case the time marks are relatively close to call insignificance. 11 minutes vs 10 mins and 41 seconds isn't a huge difference between two systems. If you run this test 10 more times with 10 different presets based on the "next variation" button you'll see a fair difference between numbers but not in terms of the overall rendering time. |
|||||||
Posted: November 21, 2012 2:51 am | ||||||||
SpaceRay
![]() |
Skybase, I do not know if you have seen that my computer is ONE CPU Quad core and the one fr om Totte is TWO CPU quad core (or six core) although is very important that they may have different GHz value so it will not be probably double speed.
I mean that I think is not possible that my Quad Core 3.4GHz is FASTER than a his MacPro 8 core computer even if it has a lower GHz value. WHY I AM ASKING THIS HERE? I have seen that having more cores DOES NOT mean that it will be a better and much faster computer, and directly computable and multiplying perfomance. I have seen some cinebench benchmarks wh ere a Quad core (8 core threads) like mine has a 7.5 score AND a Dual Xeon 12 core (24 core threads) has a score of 14.5, it should be triple the score instead BUT it is not because the Dual Xeon have each 2.2 Ghz instead of 3.4 Ghz like the Quad Core. |
|||||||
Posted: November 21, 2012 5:32 am | ||||||||
Totte
![]() |
That was the twin 6 core which means 24 threads.
It also depends on how well the render engine is. I have an example: On my laptop dual-core 2.8GHz Core i7 mobile (4 threads), a DAZ Studio render took 15 minutes and 45seconds. The same scene on my 8 Core machine took 1 minute and 12 seconds. I know that i7 mobile is painful as soon as you start to use it it will put two hyperthreads (one on each core) to 15% speed only as the CPU will fry otherwise. I was also running on mac OS X 10.8 as that is what I have on that machine, might affect things as I don't know how well FF 3 works on 10.8 (crashes frequently at least) - I never expected the Spanish inquisition |
|||||||
Posted: November 21, 2012 11:07 pm | ||||||||
Totte
![]() |
And about cores, it depends on the task and how well the code handles many cores.
If the code uses a lot of signals to synchronize the threads, about 4 or 5 is a break-even when each added core will only add marginally as it will steal so much computing time by adding more and complex synchronizations. I've been programming multithreaded and multicore for almost 15 years so I know most of the traps you can step into. - I never expected the Spanish inquisition |
|||||||
Posted: November 21, 2012 11:10 pm | ||||||||
SpaceRay
![]() |
![]() WOW! Is this really true that the 11 minutes render is made ON A 12 CORE (24 threads) COMPUTER?????? ![]() ![]() ![]() ![]() ![]() ![]() ![]() How is possible that my quad core can be faster than a 12 core ? What is the Ghz value of each of the 6 cores chips? It should be at least 2 Ghz.
YES; is true that the perfomance and speed of a software on multicore chips depends totally on how well the program is made and how can it is optimized to benefit from having more cores and CPU power. Does this mean that the Filter Forge render engine does not work well with lots of cores (or perhaps dual cpu chips) and is worse than a simple with much less cores? SO having more cores will make FF even slower? ![]() MAC vs Windows Also there is something that it can be important when comparing, and is that these are both different computers with two different Operative Systems, so this also must be taken into account, and I do not mean that MacOS may be slower than Windows. |
|||||||
Posted: November 22, 2012 4:58 am | ||||||||
Totte
![]() |
They are 2.4GHz CPU's.
I was using FF3. Tried again after restart 11 minutes sharp. Made a 3000x3000: 6mins 32 seconds - I never expected the Spanish inquisition |
|||||||
Posted: November 22, 2012 11:52 am | ||||||||
SpaceRay
![]() |
Thanks for confirming that you have the same MacPro that I have seen advertised.
You do not have to repeat the test, as I have also repetead and will be always the same, and is NOT your fault and of course I believe you and do not have a doubt that you have done it right. BUT what I still can understand is HOW is possible the following ---------------------------------------------- ---------------------------------------------- TOTTE´S COMPUTER Dual CPU 2.4Ghz Intel Xeon E5645 processors with 6 cores each = 12 cores (24 threads) MacOS Operative System Resolution 4000 x 4000 -- Render time result = 11 minutes Resolution 3000 x 3000 -- Render time result = 6 minutes 35 seconds ---------------------------------------------- ---------------------------------------------- MY COMPUTER Only one CPU 3.4Ghz Intel i7 2600K with 4 cores (8 threads) Windows 7 Operative System Resolution 4000 x 4000 -- Render time result = 10 minutes 41 seconds Resolution 3000 x 3000 -- Render time result = 5 minutes 8 seconds ---------------------------------------------- ---------------------------------------------- HOW CAN 12 CORES BE SLOWER THAN 4 CORES? Please can anyone explain HOW is possible than a dual CPU 12 core can be SLOWER than one CPU with 4 cores ?? ![]() ![]() ![]() Of course that I know that they are not having the same Ghz value and the Quad core is faster, BUT on the other hand there is TWO 2.4GHz chips and not one, so it should be surely faster, AND the architecture and hardware of the Xeon processors is better than the i7 series in this case. |
|||||||
Posted: November 22, 2012 8:49 pm | ||||||||
Casual Pixels
![]()
Posts: 96 |
Xeons are optimized for quite different behaviour profiles than the i7. Xeons are made for server situations, and are not specialized for performing a single coordinated task. This relates not only to the CPU (the optimizations there are not so different) but the different Lx caches to which the CPU has access as well as the strategy for reacting to page faults within those caches.
You might well think that that wouldn't make a difference, or not much anyway. Memory is memory. A general rule of thumb, though, is that the ratio of speed difference between Disk Access and RAM access is roughly the same as the ratio between RAM and L2 and the difference between L2 and L1. So multiple orders of magnitude if the work being performed isn't well suited to the caching strategy. So depending on how the data is localized in RAM, it might be really efficient for the CPU to work on it, while inefficient for another CPU. And if it's not well designed, there can be a huge amount of thrash in server memory no matter which CPU is asked to chew on it. Just a guess, mind. There might be something else entirely at play. |
|||||||
Posted: November 23, 2012 4:37 am | ||||||||
GMM
Moderator
Posts: 3491 |
+1 to Gilles and please keep in mind any possible differences between PC and Mac. For a more precise test you need to perform your benchmark on the same OS.
|
|||||||
Posted: November 23, 2012 12:25 pm | ||||||||
SpaceRay
![]() |
Gilles you are right that Intel Xeon Processors are optimized and made in a different way for a different market and users, and it would not be good for normal users or for gamers, and they are made specially for servers.
And they perform tasks differently and have different kind and also the inner workings and how is joined all the parts are also better done for a higher perfomance and efficiency. BUT this is why these Xeon Chips are ALSO used for High End and top Business Workstations where a higher perfomance and speed is needed and more efficiency than the desktop chips can offer, AND you can´t have more than one desktop chip in a computer, BUT you can have 2 or 4 Xeon chips in the same motherboard, or join many special twin motherboards with 2 Xeon to make more complex configurations. Gilles, I think that what you say is that it depends on the relation, combinationa and communication of hardware, the operative system and how optimized is the software to be able to benefit and get the most from a multi core AND multi chip computer, and is true that this is very important. COMPARING DIFFERENT COMPUTERS WITH DIFFERENT OS
GMM, yes, you are right that always any kind of speed test or benchmark should be made in the most similar way and be able to compare the computer with the most similar components as this will give much better results to compare more precise and realistically. I would love to find someone that could have a Windows computer with a dual chip Xeon AND that uses Filter Forge to test it better, BUT regretably I have not found it yet, and I have been searching much through internet, but until now nothing ![]() ![]() BUT I still do not understand and can´t explain HOW a computer that is supposed to be a very powerful and fast computer like the MacPro with TWO Chips with 6 cores each (12 cores) at 2,4 GHz each can be SLOWER than just ONE 4 cores chip 3,4 GHz. |
|||||||
Posted: November 23, 2012 2:28 pm | ||||||||
Casual Pixels
![]()
Posts: 96 |
What Xeons are best at are doing many independent tasks at the same time. They are optimized for that purpose. The i7, on the other hand, is optimized for performing one big task across processors. Again, merely measuring how many cycles you get (cores * speed) in one situation vs another across chip architectures is a massive oversimplification of the overall throughput you'll get from a chip. Bottom line is that there are all sorts of processes involved (page fault behaviour, instruction optimizations, float vs int) whose impact absolutely swamp number of cycles in terms of how fast some calculation will take place. Finding scenarios like the one you describe is not in any way surprising. |
|||||||
Posted: November 23, 2012 6:14 pm | ||||||||
SpaceRay
![]() |
I agree with you Gilles D, I will make a research and investigation to find the real truth behind this and HOW works the i7 chips and Xeons and Itaniums, and what is the real difference between them and which one to choose (well surely Itanium not
![]() ![]() ![]() |
|||||||
Posted: November 24, 2012 7:26 pm | ||||||||
SpaceRay
![]() |
After searching for the needed information between the benefit of having a dual cpu vs single cpu I have found some interesting things, it seems that:
It is better to have an ONE overclocked i7 3930K at 4 or 4.5 GHz with 6 cores (12 cores HT) than having TWO Xeon 2620 2 Ghz (4 Ghz both) with 12 cores (24 cores HT) if you are not using heavily 3D rendering on powerful softwares. Also the case for the motherboard must be an EEB case that is more expensive. Here I want to copy one of the informations I have found that I think is interesting information from Tom´s hardware.co.uk forum
|
|||||||
Posted: December 14, 2012 3:33 pm |
Filter Forge has a thriving, vibrant, knowledgeable user community. Feel free to join us and have fun!
33,711 Registered Users
+18 new in 30 days!
153,531 Posts
+39 new in 30 days!
15,347 Topics
+72 new in year!
24 unregistered users.