Search Results for: results viewer

You wanted it? You got it!

on February 6, 2014

As Bill said in Endurance, when we asked what new benchmark people most wanted to see, battery life was number one. Over the past few months, we’ve announced that we’re developing a battery benchmark, solicited input from the community, and have been hard at work developing BatteryXPRT 2014 for Android.

I’m happy to announce that we’re releasing BatteryXPRT 2014 for Android Community Preview 1 (CP1) later this week. CP1, as its name makes clear, is not the final BatteryXPRT 2014 for Android release. There’s still a lot of work to do on the user interface and the new results viewer. However, it’s a great way for everyone in the community to see the current state of our thinking and to provide us with feedback. You can run this version of the tool and see what you think!

As we’ve done with previous community previews, we’re also taking two more steps:

We’re not putting any publication restrictions on this preview release. Test at will, and publish your findings.
We’re releasing the source code to all community members. If you’re curious about not just what we’re doing but how we’re doing it, you can find out.

As the design document explains, BatteryXPRT 2014 for Android can estimate battery life in less than 6 hours. You can also use it as a rundown test if you prefer. In addition, CP1 is the first of our benchmarks to use the new UI design. Here’s a peek:

As you can see from the screenshot, you can run the test in Airplane mode or connected to the network. There are other cool features we’ll be talking about once CP1 is available.

We’re really excited about this one. I can’t wait!

Eric

Comment on this post in the forums

Posted in Android, BatteryXPRT 2014 for Android, Collaborative benchmark development, Community Preview |

TouchXPRT CP1

By Eric Hale

on January 30, 2014

This week we released the TouchXPRT 2014 Community Preview 1 (CP1). As with past community previews, the tests are stable and you may publish your results.

CP1 has a number of improvements over TouchXPRT 2013. We’ve updated the tests and used new and more demanding kinds of data. The Run All button is now prominent on the main screen, and the benchmark includes a results viewer.

However, as I said last week, the new UI design did not make it into CP1. Over the next few weeks, we’ll be working to give TouchXPRT an exciting new look. The results viewer will also change a lot. The current version captures the date, time, and test results, but the sandbox environment of Windows 8 applications makes getting the system information challenging. We’re working to solve that problem. We’ll also be improving the results submission to make it more streamlined.

Rest assured that, while the appearance will change, the results will not. The test results you generate with CP1 will be good for the life of the benchmark.

Community previews are only available to community members. If you are not a member, this a great time to join.

After you’ve downloaded CP1, let us know what you think by posting to the forum or e-mailing us at BenchmarkXPRTsupport@principledtechnologies.com.

Eric

Comment on this post in the forums

Posted in Collaborative benchmark development, Community Preview, TouchXPRT 2014, TouchXPRT development process, Windows 8 |

It’s almost here

By Eric Hale

on January 23, 2014

Sometime next week, we plan to release a sneak preview of TouchXPRT 2014, the TouchXPRT 2014 Community Preview 1 (CP1).

CP1, as its name makes clear, is not the final TouchXPRT 2014 release. There is still a lot of work to do on the user interface and the new results viewer. However, it includes a number of improvements over the current TouchXPRT, making it an even more useful tool for measuring Windows 8 and Windows 8.1 device performance. It is also a great way for everyone in the community to see the current state of our thinking and to provide us with feedback. You can run this version of the tool and see what you think!

As we have done with previous community previews, we’re also taking two more steps:

We’re not putting any publication restrictions on this preview release. Test at will, and publish your findings.
We’re releasing the source code to all community members. If you’re curious about not just what we’re doing but how we’re doing it, you can find out.

We believe these steps make the tool easier to evaluate and more useful to all of us.

Releasing a preview version is a lot of work, because we have to do much of the work of a software release and on less-than-final code, but we believe the value to our community justifies the effort.

Next week, when we release CP1, I’ll go over more details, the known limitations, and how you can get us your feedback—feedback we very much want.

Between now and then, we’ll be readying CP1 for your use.

Eric

Comment on this post in the forums

Posted in Collaborative benchmark development, Community Preview, TouchXPRT, TouchXPRT 2014, TouchXPRT release cycle, Windows 8, Windows 8.1 |

Anatomy of a benchmark, part II

By Bill Catchings

on August 10, 2011

As we discussed last week, benchmarks (including HDXPRT 2011) are made up of a set of common major components. Last week’s components included the Installer, User Interface (UI), and Results Viewer. This week, we’ll look more at the guts of a benchmark—the parts that actually do the performance testing.

Once the UI gets the necessary commands and parameters from the user, the Test Harness takes over. This part is the logic that runs the individual Tests or Workloads using the parameters you specified. For application-based benchmarks, the harness is particularly critical, because it has to deal with running real applications. (Simpler benchmarks may mix the harness and test code in a single program.)

The next component consists of the Tests or Workloads themselves. Some folks use those terms interchangeably, but I try to avoid that practice. I tend to think of tests as specially crafted code designed to gauge some aspect of a system’s performance, while workloads consist of a set of actions that an application must take as well as the necessary data for those actions. In HDXPRT 2011, each workload is a set of data (such as photos) and actions (e.g., manipulations of those photos) that an application (e.g., Photoshop Elements) performs. Application-based benchmarks, such as HDXPRT 2011, typically use some other program or technology to pass commands to the applications. HDXPRT uses a combination of AutoIT and C code to drive the applications.

When the Harness finishes running the tests or workloads, it collects the results. It then passes those results either to the Results Viewer or writes them to a file for viewing in Excel or some other program.

As we look to improve HDXPRT for next year, what improvements would you like to see in each of those areas?

Bill

Comment on this post in the forums

Posted in Benchmarks in general, HDXPRT workloads, Let us know your thoughts |

Anatomy of a benchmark, part I

By Bill Catchings

on August 3, 2011

Over many years of dealing with benchmarks, I’ve found that there are a few major components that HDXPRT 2011 and most others include. Some of these components are not what you might think of as part of a benchmark, but they are essential to making one both easy to use and capable of producing reproducible results. We’ll look at those parts this week and the rest next week.

The first piece that you encounter when you use a benchmark is its Installation program. Simple benchmarks may forgo an installation component and just let you copy the files, including any executables, into a directory. By contrast, HDXPRT 2011, like other application-based benchmarks, takes great pains to install the necessary applications. It even has to check to see which of them are already installed on the computer under test and cope with those it finds.

Once the benchmark is on the system, you launch it and encounter the User Interface (UI). For some benchmarks, the UI may be only a command-line interface with a set of switches or options. HDXPRT 2011, in keeping with its emphasis on an HD user experience, includes a graphical UI that lets you run its tests.

Many benchmarks, including HDXPRT 2011, provide a Results Viewer that makes it easy for you to look at your results and compare them to others. Results viewers range from fairly simple to quite sophisticated. The prevalence of spreadsheet applications and XML has led to benchmark creators minimizing the development costs of this component.

Next week, I’ll look at the components that handle the actual tests that make up the benchmark.

Bill

Comment on this post in the forums

Posted in Benchmarks in general, Let us know your thoughts |

Putting together a good WebXPRT workload proposal

By Justin Greene

on July 25, 2024

Recently, we announced that we’re moving forward with the development of a new AI-focused WebXPRT 4 workload. It will be an auxiliary workload, which means that it will run as a separate, optional test, and it won’t affect existing WebXPRT 4 tests or scores. Although the inspiration for this new workload came from internal WebXPRT discussions—and, let’s face it, from the huge increase in importance of AI—we wanted to remind you that we’re always open to hearing your WebXPRT workload ideas. If you’d like to submit proposals for new workloads, you don’t have to follow a formal process. Just contact us, and we’ll start the conversation.

If you do decide to send us a workload proposal, it will be helpful to know the types of parameters that we keep in mind. Below, we discuss some of the key questions we ask when we evaluate new WebXPRT workload ideas.

Will it be relevant and interesting to real users, lab testers, and tech reviewers?

When considering a WebXPRT workload proposal, the first two criteria are simple: is it relevant in real life, and are people interested in the workload? We created WebXPRT to evaluate device performance using web-based tasks that consumers are likely to experience daily, so real-life relevance has always been an essential requirement for us throughout development. There are many technologies, functions, and use cases that we could test in a web environment, but only some are relevant to common applications or usage patterns and are likely to draw the interest of real users, lab testers, and technical reviewers.

Will it have cross-platform support?

Currently, WebXPRT runs on almost any web browser and almost every device that supports a web browser. We would like to keep that level of cross-platform support when we introduce new workloads. However, technical differences in how various browsers execute tasks make it challenging to include certain scenarios without undermining our cross-platform ideal. When considering any workload proposal, one of the first questions we ask is, “Will it work on all the major browsers and operating systems?”

There are special exceptions to this guideline. For instance, we’re still in the early days of browser-based AI, and it’s unlikely that a new browser-based AI workload will run on every major browser. If it’s a particularly compelling idea, such as the AI scenario we’re currently working on, we may consider including it as an auxiliary test.

Will it differentiate performance between different types of devices?

XPRT benchmarks provide users with accurate measures for evaluating how well target systems or technologies perform specific tasks. With a broadly targeted benchmark like WebXPRT, if the workloads are so heavy that most devices can’t handle them or so light that most devices complete them without being taxed, the results will be of little use for helping buyers evaluating systems and making purchasing decisions, OEM labs, and the tech press.

That’s why, with any new WebXPRT workload, we look for a sweet spot with respect to how computationally demanding it will be. We want it to run on a wide range of devices—from low-end devices that are several years old to brand-new high-end devices, and everything in between. We also want users to see a wide range of workload scores and resulting overall scores that accurately reflect the experiences those systems deliver, so they can easily grasp the different performance capabilities of the devices under test.

Will results be consistent and easily replicated?

Finally, WebXPRT workloads should produce scores that consistently fall within an acceptable margin of error and are easily replicated with additional testing or comparable gear. Some web technologies are very sensitive to uncontrollable or unpredictable variables, such as internet speed. A workload that measures one of those technologies would be unlikely to produce results that are consistent and easily replicated.

We hope this post will be useful if you’re thinking about potential new workloads that you’d like to see in WebXPRT. If you have any general thoughts about browser performance testing or specific workload ideas that you’d like us to consider, please let us know.

Justin

Posted in AI, benchmark, Benchmark metrics, Benchmarking, BenchmarkXPRT, browser performance, Browser-based benchmarks, Collaborative benchmark development, Cross-platform benchmarks, Performance benchmarking, WebXPRT, WebXPRT 4, What makes a good benchmark? | Tagged AI, AI workloads, auxiliary workload, BenchmarkXPRT, BenchmarkXPRT Development Community, browser benchmark, browser performance, browser-based AI, browsers, tech press, WebXPRT, WebXPRT 4, workloads, XPRTs |

Search Results for: results viewer

You wanted it? You got it!

TouchXPRT CP1

It’s almost here

Anatomy of a benchmark, part II

Anatomy of a benchmark, part I

Putting together a good WebXPRT workload proposal

Check out the other XPRTs: