Benchmarking the PetaCache
Methodology
SCCS has set up a farm in IR2 for us to use while benchmarking the Petacache. As this farm is shared with BaBar simulation generation, we need to schedule the usage for serious tests. For tool development, there should be no conflict.
The farm consists of 40 nodes of dual processor, dual core AMD Opteron(tm) 275 nodes running RHEL 5.5. (AMD nominal speed = 4x2200MHz). (Note: These systems seem to be using dynamic power management to reduce their current cpu speed, so /proc/cpuinfo may show values as low as 1000 MHz.) The node names are ir2farm{01,02...40
}. Home directories are AFS, and SLAC credentials transfer.
Benchmarking Tools
FileSystem Benchmarking Projects
- Project FSBench (SUNY-SB)
- Parallel I/O Benchmarking Consortium (ANL)
Discussions of Flash
- Sun: Flash/ZFS
- Usenix FAST-10 Proceedings
- DFS: A FileSystem for Virtualized Storage (Princeton/Fusion-io)
Discussion of filesystem design using commercial flash drives (PCI-EX based). Design is an actual minimal filesystem, managed by the kernel. Loadable kernel module required – thus, superuser privs and kernel access needed. These filesystems might possibly be able to be exported via NFS.
Benchmarks discussed include:- Micro benchmarks (write/read rate using Iozone (q.v.))
Problems with their method: only two threads pounding on it. - Application benchmarks: Most not useful for us. 3/5 are either mem-mapped IO or database like.
- Micro benchmarks (write/read rate using Iozone (q.v.))
- Extending SSD Lifetimes with Disk-Based Write Caches (Slides) (Toronto/Microsoft)
- Write Endurance in Flash Drives: Measurements and Analysis (Slides) (Northeastern)
- DFS: A FileSystem for Virtualized Storage (Princeton/Fusion-io)
Standalone Papers
- Benchmarking Parallel I/O Performance For a Large Scale Scientific Application on the Teragrid
- File System Workload Analysis For Large Scale Scientific Computing Applications
- PostMark: A New File System Benchmark