Computational species "bw filter simplified less" (CID=45741e3fbcf4024b:1db78910464c9d05)

Notes

Some cost-aware experiments (execution time, size, energy, compilation time) performed by Grigori Fursin. It supports our research on continuous performance tracking, code optimization and compiler benchmarking (regression detection).

This computation species (kernel) is a threshold filter - it is used in image processing and neuron activation functions (part of artificial neural networks).

Used artifacts

  • Systems:
    • S1 = Dell Laptop Latitude E6320, Processor=P1, Memory = 8Gb, Storage=256Gb (SSD), Max power consumption=52W, Cost (time of purchase)~1200 euros (CID=cb7e6b406491a11c:0d84339816de0271)
    • S2 = Samsung Mobile Galaxy Duos GT-S6312, Processor=P2, Memory = 0.8Gb, Storage=4Gb, Battery=1300 mAh / 3.9V / up to 250 hours, Max power consumption~5W, Cost (time of purchase)~200 euros (CID=cb7e6b406491a11c:a9740acbe06bcd1e)
    • S3 = Polaroid Tablet Executive 9" MID0927, Processor=P3, Memory=1Gb, Storage=16Gb, Battery=3500 mAh / 3.9V / up to 80 hours, Max power consumption~13W, Cost (time of purchase)~100 euros (CID=cb7e6b406491a11c:3419444faf22f3d0)
  • Compilers:
    • X1 = GCC 4.1.1
    • X2 = GCC 4.4.1
    • X3 = GCC 4.4.4
    • X4 = GCC 4.6.3
    • X5 = GCC 4.7.2
    • X6 = GCC 4.8.3
    • X7 = GCC 4.9.1
    • X8 = LLVM 3.1
    • X9 = LLVM 3.3.2
    • X10 = Open64 5.0
    • X11 = PathScale 2.3.1
    • X12 = NVidia CUDA Toolkit 5.0
    • X13 = Microsoft Visual Studio 2013, cost = has free minimal version
    • X14 = Intel Composer XE 2011, cost =
  • Compiler optimization level:
    • O1 = Performance (usually -O3)
    • O2 = Size (usually -Os)
  • Number of run-time code repetitions (for example, processing steps in neural networks):
    • R1 = 4000
    • R2 = 1000
    • R3 = 400
  • Total number of computations (processed neurons or pixels)
    • T1 ~ 9.6E9
    • T2 ~ 2.4E9
    • T3 ~ 1.0E9
  • Costs
    • C1= Execution time
    • C2 = Energy
    • C3 = Code size
    • C4 = Compilation time
    • C5 = System size
    • C6 = Hardware price
    • C7 = Software price
    • C8 = (Auto-)tuning price
    • C9 = Development time
    • C10 = Validation and testing time

Notes

Energy: 1Wh = 3600 joules

W = mAh * V / 1000 = 1300 * 3.9 / 1000 ~ 5W


(C) 2011-2014 cTuning foundation