b'Hyesoon Kim'

research

∙ 08/05/2023

RV-CURE: A RISC-V Capability Architecture for Full Memory Safety

Despite decades of efforts to resolve, memory safety violations are stil...

0 Yonghae Kim, et al. ∙

research

∙ 02/17/2023

VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs

Deep Learning (DL) acceleration support in CPUs has recently gained a lo...

0 Geonhwa Jeong, et al. ∙

research

∙ 02/01/2023

Revisiting Query Performance in GPU Database Systems

GPUs offer massive compute parallelism and high-bandwidth memory accesse...

0 Jiashen Cao, et al. ∙

research

∙ 06/16/2022

CuPBoP: CUDA for Parallelized and Broad-range Processors

CUDA is one of the most popular choices for GPU programming, but it can ...

0 Ruobing Han, et al. ∙

research

∙ 12/19/2021

COX: CUDA on X86 by Exposing Warp-Level Functions to CPUs

As CUDA programs become the de facto program among data parallel applica...

0 Ruobing Han, et al. ∙

research

∙ 10/21/2021

Vortex: Extending the RISC-V ISA for GPGPU and 3D-GraphicsResearch

The importance of open-source hardware and software has been increasing....

0 Blaise Tine, et al. ∙

research

∙ 10/05/2021

RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

As AI-based applications become pervasive, CPU vendors are starting to i...

0 Geonhwa Jeong, et al. ∙

research

∙ 09/02/2021

Supporting CUDA for an extended RISC-V GPU architecture

With the rapid development of scientific computation, more and more rese...

0 Ruobing Han, et al. ∙

research

∙ 04/09/2021

Context-Aware Task Handling in Resource-Constrained Robots with Virtualization

Intelligent mobile robots are critical in several scenarios. However, as...

0 Ramyad Hadidi, et al. ∙

research

∙ 04/09/2021

Creating Robust Deep Neural Networks With Coded Distributed Computing for IoT Systems

The increasing interest in serverless computation and ubiquitous wireles...

0 Ramyad Hadidi, et al. ∙

research

∙ 02/16/2021

THIA: Accelerating Video Analytics using Early Inference and Fine-Grained Query Planning

To efficiently process visual data at scale, researchers have proposed t...

0 Jiashen Cao, et al. ∙

research

∙ 11/22/2020

Copernicus: Characterizing the Performance Implications of Compression Formats Used in Sparse Workloads

Sparse matrices are the key ingredients of several application domains, ...

0 Bahar Asgari, et al. ∙

research

∙ 11/17/2020

Secure Location-Aware Authentication and Communication for Intelligent Transportation Systems

Intelligent transportation systems (ITS) are expected to effectively cre...

0 Nima Shoghi Ghalehshahi, et al. ∙

research

∙ 11/13/2020

Reducing Inference Latency with Concurrent Architectures for Image Recognition

Satisfying the high computation demand of modern deep learning architect...

0 Ramyad Hadidi, et al. ∙

research

∙ 03/13/2020

Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution

The rise of deep neural networks (DNNs) is inspiring new studies in myri...

0 Ramyad Hadidi, et al. ∙

research

∙ 02/27/2020

Vortex: OpenCL Compatible RISC-V GPGPU

The current challenges in technology scaling are pushing the semiconduct...

0 Fares Elsabbagh, et al. ∙

research

∙ 05/18/2019

A Case Study: Exploiting Neural Machine Translation to Translate CUDA to OpenCL

The sequence-to-sequence (seq2seq) model for neural machine translation ...

0 Yonghae Kim, et al. ∙

research

∙ 01/08/2019

Collaborative Execution of Deep Neural Networks on Internet of Things Devices

With recent advancements in deep neural networks (DNNs), we are able to ...

0 Ramyad Hadidi, et al. ∙

research

∙ 02/05/2018

Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices

The prevalence of Internet of things (IoT) devices and abundance of sens...

0 Ramyad Hadidi, et al. ∙

research

∙ 10/30/2017

Louvre: Lightweight Ordering Using Versioning for Release Consistency

Fence instructions are fundamental primitives that ensure consistency in...

0 Pranith Kumar, et al. ∙

research

∙ 10/26/2017

CODA: Enabling Co-location of Computation and Data for Near-Data Processing

Recent studies have demonstrated that near-data processing (NDP) is an e...

0 Hyojong Kim, et al. ∙

research

∙ 07/17/2017

Performance Implications of NoCs on 3D-Stacked Memories: Insights from the Hybrid Memory Cube

Memories that exploit three-dimensional (3D)-stacking technology, which ...

0 Ramyad Hadidi, et al. ∙

research

∙ 06/08/2017

Demystifying the Characteristics of 3D-Stacked Memories: A Case Study for Hybrid Memory Cube

Three-dimensional (3D)-stacking technology, which enables the integratio...

0 Ramyad Hadidi, et al. ∙

Hyesoon Kim

Featured Co-authors

Sign in with Google

Consider DeepAI Pro