Thickness, Thermals and Core: Understanding how Surface Pro 3 Got so Thin

Ever since Surface RT showed up I've wanted Surface Pro but in the RT chassis. Microsoft seemed to consciously avoid any lower TDP Core processors in favor of Ultrabook SKUs, which pretty much kept Surface Pro thicker than its ARM-based counterpart. Surface Pro 3 changes everything with a rethinking of how Microsoft embraced the thermal constraints of a tablet.

Surface Pro 3 is available with one of three CPU options: a Core i3, Core i5 or Core i7. The latter two are 15W TDP parts, while the former is a 11.5W TDP/6W SDP Haswell Y SKU. Haswell Y is really designed for passively cooled form factors, but all three CPU options receive the same heatpipe and fan cooling system offered by the Surface Pro 3. In short, the Core i3 model should behave a lot more like a standard Ultrabook part (sans Turbo).


Surface Pro 2 (left) vs. Surface Pro 3 (right)


Apple iPad Air (left) vs. Surface Pro 3 (right)

Microsoft's claim to fame is the ability to build the world's thinnest device with a Core series CPU inside. At 9.1mm thick, the Surface Pro 3 is just barely thicker than last year's Surface 2 but with a full blown dual-core Haswell instead instead of a high-end phone/tablet ARM SoC. As both the Surface Pro 2 and Surface Pro 3 used 15W Haswell Ultrabook parts and Microsoft claims a performance increase over last year's model, the question is how Microsoft can reduce thickness, keep TDP the same and increase performance. The answer is simply a difference in what's acceptable from a thermal standpoint.

Here's a look at the original Surface Pro while running an hour long loop of 3DMark Ice Storm Unlimited. I chose this test in particular as it behaves as a nice balance between CPU and GPU workloads on the device:

You can almost trace out the path of the heat pipe carrying the thermal load away from the CPU and to the two fans in the system. I measured peak temperature here at 41.7C.
Now let's look at Surface Pro 2:

Despite a move to Haswell, Microsoft lets the device get much warmer. You can make out the same U-shape thermal distribution as heat is carried away using the two fans, but the temperatures are much higher. I measured a maximum temperature running the same workload of 47.3C.

Here we have Surface Pro 3:

Right away you can see the new cooling system at work. The thermal load is mostly confined to the left side of the image (we are looking at the back with the kickstand unfolded towards the camera, so if you were looking at the display we're talking about the upper right side of the machine). The cooling solution is far more compact and I suspect ready for a move to Core M (Broadwell). Peak temps however are much closer to Surface Pro 1 at 43.2C.

The thermal story points us in the right direction. Either Surface Pro 3's fan and heatpipe configuration is able to remove heat far better than Surface Pro 2's design could, or the CPU in SP2 doesn't get as warm. I suspect it's the latter.

For starters, I'm guessing that Intel is helping Microsoft with delivering better binned Haswell ULT and Y series SKUs. But the big change is I believe Microsoft is more aggressive about reducing CPU and GPU frequencies in Surface Pro 3 compared to Surface Pro 2. Benchmarks will show an increase in performance due to more aggressive ramping up/down of clock speeds vs. Surface Pro 2, but prolonged load cases will likely show a decrease in performance vs. last year's model.

I first stumbled upon this behavior while trying to gather thermal data for Surface Pro 3. I noticed large run to run variance if I repeatedly ran 3DMark 11. I'm used to seeing this sort of behavior on smartphones that throttle quickly, but it was unique for a Surface Pro device.

3DMark is a synthetic test so the real question was how would Surface Pro 3 perform in a real world scenario where sustained CPU/GPU load was guaranteed for a long period of time. I figured a game playable on the machine like Dota 2 would be a great example. I asked our own Ryan Smith to whip up a custom benchmark using the game and I ran it on Surface Pro 1, 2, 3 as well as a 13-inch MacBook Air (Early 2014). The graph below illustrates average frame rate during our Dota 2 benchmark for all of the systems:
As you can see, even with Surface Pro 3's fan running the platform doesn't deliver sustained performance equal to last year's model. It's an understandable tradeoff given the substantial reduction in device thickness (and thus improvement in usability), but it's important to note nonetheless.

Dota 2 Performance - ATDotaBench

Workloads that are CPU and GPU heavy are one thing, but what about sustained productivity workloads? Penny Arcade's insanely talented artist Gabe mentioned that he noticed substantial lag when drawing on his Surface Pro 3. As Surface Pro 3's NTrig pen is actually lower latency than Surface Pro 2's, I wonder if what Gabe ran into might be thermal related rather than pen related (alternatively it could have to do with the much higher display resolution that Surface Pro 3 runs at). I needed a repeatable workload to see if non-gaming use cases also showed a regression. Thankfully PCMark 8 v2 provides a number of relatively long, repeatable workloads that are great for testing just this.

I decided to use the PCMark 8 v2 Work suite which includes web browsing, office producitivity (word processing and spreadsheet work) and video chat. It's a far cry from a Cinebench loop but I figured if I saw throttling here it would easily be present in heavier workloads. Each run of the suite actually repeats the tests three times and takes around 20 minutes to complete on the Surface devices. I measured performance for three suite runs (9 total runs of the Work loop) on both Surface Pro 2 and 3:

PCMark 8 v2 Work Performance Over Time
  Work Suite Run #1 Work Suite Run #2 Work Suite Run #3
Microsoft Surface Pro 3 (Core i5) 3273 3031 (92% of peak) 3129 (95% of peak)
Microsoft Surface Pro 2 (Core i5) 3222 3223 3218

As you can see, there's a 9% drop in performance on Surface Pro 3 from the first suite run to the next while Surface Pro 2 delivers consistent performance between runs. By the third run Surface Pro 3's performance recovers a bit, although it's still a few percent below the initial, cool run.

PCMark 8 v2 also plots CPU temperature and frequency throughout the course of the benchmark. Unfortunately I don't think I can export the data so I'm forced to present screenshots from the benchmark harness itself. Thankfully even looking at these screenshots is enough to tell us what's going on between the two Surface Pro devices:


Surface Pro 2 PCMark 8 v2 Work Suite


Surface Pro 3 PCMark 8 v2 Work Suite

Both devices do a good job of reaching max turbo fairly regularly, but Surface Pro 2 appears to spend more time at its max turbo state. By the last loop of the benchmark Surface Pro 3's max power is limited (green line) by the time we get to the video chat workload, I suspect that is where we lose a bit of performance. I'm only looking at the last of three suite runs here, if we looked at the second one the results would be even worse.

I haven't tested Surface Pro 3 in Microsoft's new docking station so I don't know if the device changes its thermal profile at all when docked, but in tablet or laptop mode it's definitely going to throttle quicker than Surface Pro 2 did. I must stress that I don't think this is necessarily a bad idea. Surface Pro 3 improves usability over Surface Pro 2 by leaps and bounds, and it's still faster under load than the original Surface Pro. All of this comes at the expense of reduced sustained performance. As you'll see from the rest of our tests, peak performance did get better over last year's model.

Fan Noise

Surface Pro 3's single fan is more noticeable than in Surface Pro 2. The sound it produces isn't necessarily louder, and when spinning it sounds a lot like any other Ultrabook with a fan in it. Compared to Surface Pro 2, the fan does kick in more frequently. For light tablet workloads or even light office work on Surface Pro 3, the fan remains silent. It's only when you're doing anything CPU or GPU intensive that you'll hear it spin up. The big difference is that in situations where you wouldn't hear fans spinning on Surface Pro 2, you'll sometimes hear it on SP3. This is the tradeoff that comes with the thinner chassis.

The New Type Cover & Pen Battery Life
Comments Locked

274 Comments

View All Comments

  • Drumsticks - Monday, June 23, 2014 - link

    Heavier isn't a bad thing. The SP pen is light in my opinion, I've yet to see somebody say the pen was uncomfortable. Again, on the pen, from above:

    "Overall I have to say the new pen is an improvement. You get a much thinner display, an unnoticeable impact to pressure sensitivity, improved latency and the experiential improvements are substantial (one click wake/OneNote launch is useful)."

    I'm not saying this as an artist. But as a user, note taking (which is my primary thing as a college student) seems to be improved.

    Upgrade the GPU to what? They weren't going to get dedicated graphics in here, and the i7 model uses an HD5000. The throttling is meh, as Anand again mentioned.

    I'm not saying it's perfect. I want this with Broadwell, badly. But it's a big improvement. It's not like it's going to be lacking for performance for what it's needed for. I don't expect mobile gaming out of this device, and that's the only time it really saw throttling.
  • ymcpa - Monday, June 23, 2014 - link

    Most reviews praised the pen for being heavier and more comfortable to hold than the original pen. Gabe from penny arcade never mentioned that the N-Trig was less accurate. I will take an artist opinion over yours. I have read several reviews where it was stated that the Wacom digitizer in the pro 2 wasn't very accurate near the edges.
  • mkozakewich - Monday, June 30, 2014 - link

    As someone who's suffering with the edges of my Pro, I'd like to hear about how the NTrig handles around the edges. As it's a different technology and doesn't rely on something behind the screen, I don't think there'll be any problems.
  • Imaginer - Tuesday, June 24, 2014 - link

    As an artist, and someone who regularly uses OneNote for notes on a previous Surface Pro 2, and after using a Surface Pro 3...

    With Wacom pens, they are hefty like a light pencil, and handle as such. The N-Trig pen of the Surface Pro 3, feels like a quality solid inking pen in weight.

    The time measured lag, is for long, prolonged strokes of quick length, instead of short and slow velocity movements. Most of the time, in writing, one isn't writing in big letters. The only time this MAY come up, is one doing quick, very long lines of strokes. Most of the time in drawing, such strokes are not too often and few in between in sketching and painting.

    My only gripe, is that with the default Pro 2's pen, is that the eraser end is pressure sensitive as well, and can be flipped to erase. The Pro 3's pen, uses a second button on the side for erasing, which isn't as comfortable for me, and an adjustment of such. Here is hoping a second more "pencil" like pen that has a pressure eraser end is released. (along with lack of Wacom's radial like menu).

    In the forums, I made an image showing handwriting aspects of pen comparisons - something that I felt would be a nice touch in the pen mentions - as it is not hard to pull a comparison between the two.
  • ymcpa - Monday, June 23, 2014 - link

    This was explained by Microsoft. A human can't possibly replicate 1028 points of pressure. They can't even replicate the 256 points that the N-trig has. It's really about the pressure curve that they use. Microsoft said they will have an update that will allow users to alter the pressure curve to fit their usage.
  • basroil - Tuesday, June 24, 2014 - link

    No, Microsoft explained that even though Wacom's pen supports 1024bit, the actual useful resolution is much smaller because of how the pressure curve is set up (it's linear rather than semi-exponential like the n-trig)
  • basroil - Tuesday, June 24, 2014 - link

    1024 levels (10bit), not 1024bit (which would be a ridiculous number)
  • ymcpa - Tuesday, June 24, 2014 - link

    They talked about the pressure curve but the blog also mentioned that the human hand can't reproduce 1024 different levels of pressure. That is why it won't matter if the N-trig digitizer has less levels of pressure.
  • theuglyman0war - Wednesday, June 25, 2014 - link

    Even if an artist can not replicate 1024 levels of pressure in a single stroke ( wonder how u prove as much as humans are scary amazing beings at times ) An artist can easily need as much "threshold" in many stokes... Working very light where details build up in whispers. In which case 256 levels unutilized is complete hogwash.
  • mkozakewich - Monday, June 30, 2014 - link

    It's not even about what the artist touches on themselves, and more about the range. I'd like my pen to to be able to step from size 1 to size 255 in 1-step increments, in which case the sensitivity levels would work fine and I'd still have a pen with a massive brush. (Honestly, I'd probably not use 255 levels. More like 64.)

Log in

Don't have an account? Sign up now