About me
I am a final-year undergraduate student at Dian Nuswantoro University. My research interests are in systems and networking, with an emphasis on building adaptive, high-performance runtime systems for cloud platforms. I explore how telemetry signals, approximate computation (e.g., sketches, sampling), and machine learning-based optimization can be used to improve scalability, reduce operational costs, and enhance responsiveness under real-world workloads.
News
- [09/2025] PromSketch-Standalone will be released as an open-source project.
- [09/2025] Analyzing the Impact of Transpose Layers on CNN-Based Deep Learning will be presented at iSEMANTIC’25
- [05/2025] I will be collaborating with Prof. Alan Zaoxing Liu this summer.
- [07/2025] Analyzing the Impact of Transpose Layers on CNN-Based Deep Learning was accepted by iSEMANTIC'25!
- [01/2025] I will be joining the UChicago-Indonesia System and AI Research Training Program this winter!
- [08/2024] I will be a Research Assistant at Dinus Research Center.
- [08/2024] I will be a Teaching Assistant for Machine Learning and Data Science courses at Dian Nuswantoro University!
- [08/2024] I will be interning at Dinus AI Research and Development Group this fall!
Publications
[iSEMANTIC'25] Analyzing the Impact of Transpose Layers on CNN-Based Deep Learning
Sie, Deta Dirganjaya, Guruh Fajar Shidik, Maulidya Ayu Ardiena, Radhitya Marendratama, and Edi Jaya Kusuma
to appear in iSEMANTIC'25
Paper Summary
This study evaluates the impact of integrating transpose convolution (ConvTranspose2d) layers into two modern YOLO architectures, YOLOv5m and YOLOv8m, for real-time pothole detection in complex road environments. The research replaces standard upsampling layers with learnable transpose layers in the detection head to better recover fine spatial details, which is important for identifying small, irregular, and partially obscured potholes. Experimental results show performance improvements in both models: YOLOv5m with transpose layers achieved a 1.57% increase in mAP@50, while YOLOv8m achieved a 3.46% improvement, though with higher inference latency. YOLOv5m provided the best balance between accuracy and real-time operation, while YOLOv8m benefited from stronger recall gains. These findings suggest that learnable upsampling can enhance detection sensitivity, particularly in visually complex scenarios, and can be applied to other real-time detection tasks that require detailed feature reconstruction.
Experience
Research Intern
FROOT Lab, University of Maryland, College Park, United States
May. 2025 - Present
Fortunate to collaborate with Prof. Alan Zaoxing Liu
UChicago-Indonesia Research Training Program
University of Chicago, Illinois, United States
Jan. 2025 - Jun. 2025
Fortunate to be mentored by Prof. Haryadi S. Gunawi
Research Assistant
Dinus Research Center, Dian Nuswantoro University, Indonesia
Aug. 2024 - Jan. 2025
Fortunate to be advised by Prof. Dr. Guruh Fajar Shidik
Research Intern
Dinus AI Research and Development Group, Dian Nuswantoro University, Indonesia
Aug. 2024 - Jan. 2025
Fortunate to be mentored by Ardytha Luthfiarta and Adhitya Nugraha
Teaching
Open-Source Project
PromSketch-Standalone
Description
PromSketch-Standalone is a fully autonomous cloud telemetry runtime that supports high-throughput ingestion, sketch-based storage, PromQL-compatible querying, and visualization without relying on traditional monitoring systems. It features a parallel, event-driven ingestion pipeline capable of processing millions of time-series samples per second with significantly reduced latency and resource overhead. The system integrates a sketch-aware query and storage engine that performs real-time analytics directly over compressed metric summaries, providing high statistical accuracy while drastically reducing memory footprint.
VSA: Visual Smart Assistant
Description
Virtual Smart Assistant (VSA) is a personalized companion system designed to support students at Dian Nuswantoro University in both campus services and emotional well-being. It detects student emotions using facial recognition and a custom lightweight CNN for edge-based emotion tracking, and responds empathetically through an LLM-powered conversational engine trained on an Indonesian student-dialogue corpus. VSA features real-time noise suppression for clear voice interaction and is deployed on NVIDIA Jetson and Raspberry Pi devices, enabling adaptive, low-latency inference under limited compute and bandwidth conditions. The system aims to provide an accessible, responsive, and supportive virtual presence for students facing academic or personal challenges.
CLCM: Custom Lightweight CNN Model for low-end edge devices
Description
Low-end edge devices such as the Jetson Nano are constrained by limited computing power, memory, and graphics processing capacity, posing challenges for running complex models efficiently. To overcome these limitations, lightweight and fast models are essential to ensure real-time performance without lag, enabling timely and accurate emotional responses for users. In this research, we introduce the Custom Lightweight CNN Model (CLCM), a CNN-based face recognition model optimized for speed and efficiency while maintaining high accuracy. The results demonstrate that CLCM outperforms well-known pre-trained models, achieving up to 3× faster inference speed, with an average real-time detection time of 13 ms. Additionally, the model maximizes GPU utilization on the Jetson Nano, reaching 90% efficiency through CUDA integration, further enhancing performance without compromising device limitations.
Awards
- International Research Scholarship, Directorate General of Higher Education Indonesia | 2025
- Top 50 CS students in Indonesia. Part of Indonesia's academic research development program.
- Flagship Program Student of Computer Science, Dian Nuswantoro University | 2023, 2024, 2025
- Top 50 of 1,855 CS students (Batch 2022). Part of the university’s flagship acceleration program.