Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||||
| DOI | 10.1109/DSD.2019.00105 | ||||
| Año | 2019 | ||||
| Tipo | proceedings paper |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
Clustering is an essential operation in many data analysis applications. In particular, bioinformatics and genome analysis use clustering to group similar components in sequence data, in order to find important patterns such as DNA motifs. In this paper, we present an algorithm that clusters DNA data using locality-sensitive hashing with MinHash to group similar subsequences in large Chip-seq datasets. Tested on a standard mESC dataset, the algorithm builds clusters that contain subsequences with high-score matches to known DNA motifs. We also describe the architecture and implementation of a hardware accelerator on a Xilinx Kintex-7 XC7K325T FPGA, that exploits the parallelism of the algorithm to cluster data with a throughput of one k-mer per clock cycle at 350MHz. The accelerator achieves a speedup of 91 compared to a parallel software implementation of the algorithm on a 24-core server.
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | Soto, Javier E. | Hombre |
Universidad de Concepción - Chile
|
| 2 | GODOY-MEDEL, SEBASTIAN EUGENIO | Hombre |
Universidad de Concepción - Chile
|
| 3 | HERNANDEZ-RIVAS, CECILIA PAOLA | Mujer |
Universidad de Concepción - Chile
Ctr Biotechnol & Bioengn CeBiB - Chile |
| 4 | FIGUEROA-YEVENES, MAXIMILIANO | Hombre |
Universidad de Concepción - Chile
|
| 5 | Konofaos, N | - | |
| 6 | Kitsos, P | - |
| Fuente |
|---|
| CONICYT-PFCHA |
| Fondo Nacional de Desarrollo Científico y Tecnológico |
| Comisión Nacional de Investigación Científica y Tecnológica |
| Basal Funds |
| CONICYT, Chile, through Fondecyt |
| Agradecimiento |
|---|
| VI. ACKNOWLEDGMENTS This work was supported by CONICYT, Chile, through FONDECYT grant 1180995 and Basal Funds FB0001. J. E. Soto acknowledges the support of CONICYT-PFCHA graduate scholarship folio 21161631. |
| This work was supported by CONICYT, Chile, through FONDECYT grant 1180995 and Basal Funds FB0001. J. E. Soto acknowledges the support of CONICYT-PFCHA graduate scholarship folio 21161631. |