Please use this identifier to cite or link to this item: https://doi.org/10.1145/2503210.2503234
DC FieldValue
dc.titleAccelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes
dc.contributor.authorTang, W.T.
dc.contributor.authorTan, W.J.
dc.contributor.authorRay, R.
dc.contributor.authorWong, Y.W.
dc.contributor.authorChen, W.
dc.contributor.authorKuo, S.-H.
dc.contributor.authorGoh, R.S.M.
dc.contributor.authorTurner, S.J.
dc.contributor.authorWong, W.-F.
dc.date.accessioned2014-07-04T03:11:13Z
dc.date.available2014-07-04T03:11:13Z
dc.date.issued2013
dc.identifier.citationTang, W.T., Tan, W.J., Ray, R., Wong, Y.W., Chen, W., Kuo, S.-H., Goh, R.S.M., Turner, S.J., Wong, W.-F. (2013). Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes. International Conference for High Performance Computing, Networking, Storage and Analysis, SC : -. ScholarBank@NUS Repository. https://doi.org/10.1145/2503210.2503234
dc.identifier.isbn9781450323789
dc.identifier.issn21674337
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/77996
dc.description.abstractThe sparse matrix-vector (SpMV) multiplication routine is an important building block used in many iterative algo-rithms for solving scientific and engineering problems. One of the main challenges of SpMV is its memory-boundedness. Although compression has been proposed previously to im-prove SpMV performance on CPUs, its use has not been demonstrated on the GPU because of the serial nature of many compression and decompression schemes. In this pa-per, we introduce a family of bit-representation-optimized (BRO) compression schemes for representing sparse matrices on GPUs. The proposed schemes, BRO-ELL, BRO-COO, and BRO-HYB, perform compression on index data and help to speed up SpMV on GPUs through reduction of memory trafic. Furthermore, we formulate a BRO-aware matrix re-ordering scheme as a data clustering problem and use it to increase compression ratios. With the proposed schemes, ex-periments show that average speedups of 1.5 compared to ELLPACK and HYB can be achieved for SpMV on GPUs. Copyright 2013 ACM.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/2503210.2503234
dc.sourceScopus
dc.subjectData compression
dc.subjectGPU
dc.subjectMatrix-vector multiplication
dc.subjectMemory bandwidth
dc.subjectParallelism
dc.subjectSparse matrix format
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1145/2503210.2503234
dc.description.sourcetitleInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
dc.description.page-
dc.identifier.isiut000345856900027
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.