GWA-T-12 Bitbrains

Trace Description

The dataset contains the performance metrics of 1,750 VMs from a distributed datacenter from Bitbrains, which is a service provider that specializes in managed hosting and business computation for enterprises. Customers include many major banks (ING), credit card operators (ICS), insurers (Aegon), etc. Bitbrains hosts applications used in the solvency domain; examples of application vendors are Towers Watson and Algorithmics. These applications are typically used for financial reporting, which is used predominately at the end of financial quarters.

Each file contains the performance metrics of a VM. These files are organized according by traces: fastStorage and Rnd.

The first trace, fastStorage, consists of 1,250 VMs that are connected to fast storage area network (SAN) storage devices. The second trace, Rnd, consists of 500 VMs that are either connected to the fast SAN devices or to much slower Network Attached Storage (NAS) devices. The fastStorage trace includes a higher fraction of application servers and compute nodes than the Rnd trace, which is due to the higher performance of the storage attached to the fastStorage machines. Conversely, for the Rnd trace we observe a higher fraction of management machines, which only require storage with lower performance and less frequent access.

In the Rnd directory, the files are organized into 3 sub-directories by the month that the metrics are recorded.

The format of each file is row-based, each row represent an observation of the performance metrics. Each column of a row is separate by “;\t” The format of each row is

  1. Timestamp: number of milliseconds since 1970-01-01.
  2. CPU cores: number of virtual CPU cores provisioned.
  3. CPU capacity provisioned (CPU requested): the capacity of the CPUs in terms of MHZ, it equals to number of cores x speed per core.
  4. CPU usage: in terms of MHZ.
  5. CPU usage: in terms of percentage
  6. Memory provisioned (memory requested): the capacity of the memory of the VM in terms of KB.
  7. Memory usage: the memory that is actively used in terms of KB.
  8. Disk read throughput: in terms of KB/s
  9. Disk write throughput: in terms of KB/s
  10. Network received throughput: in terms of KB/s
  11. Network transmitted throughput: in terms of KB/s

Contributors

Trace sharing:
Bitbrains

Trace analysis:
Siqi Shen, Vincent van Beek, Alexandru Iosup

Trace Download

Copyright note
This trace was graciously provided by Bitbrains IT Services Inc. To use this traces, you must include an acknowledgement to the source of the data in any published material that refers to the data. Please refer to the CCGrid 2015 paper, and please also consider referring to the Grid Workloads Archive in the acknowledgements.

Download the GWA-T-12  fastStorage trace [128MB]
Download the GWA-T-12 rnd trace [156MB]

Analysis report

A detailed analysis report of this GWA-T-12 trace is available online.

Side content