Sultan Mahmud Sajal

Performance Engineer at NetApp · Systems Researcher · Ph.D. Penn State

smsajal-2.JPG

San Jose, CA

smsajal116@gmail.com

I am a Performance Engineer at NetApp, where I work on feature performance engineering and research prototyping for the AI Data Engine (AIDE). Previously, I was a Performance and Capacity Engineer at Meta, where I worked on AI inference and training infrastructure and hardware power budget optimization.

I received my Ph.D. in Computer Science and Engineering from Pennsylvania State University in 2024, advised by Prof. Timothy Zhu and Prof. Bhuvan Urgaonkar. My dissertation, “Improving the Fidelity of Trace-Driven Experiments in Cloud Computing Systems,” introduced a family of trace scaling techniques now deployed in production at Azure.

My research sits at the intersection of cloud systems, performance engineering, and AI infrastructure. I publish at top systems venues — OSDI, EuroSys, ISCA, and ACM TOCS — and my work has earned a Best Paper Runner-Up award at EuroSys 2024.

Research areas: cloud computing · distributed systems · performance engineering · AI/ML infrastructure · systems evaluation methodology

news

Jan 06, 2026 Joined NetApp as a Performance Engineer, working on feature performance engineering and AI Data Engine (AIDE) research.
May 20, 2024 Joined Meta Platforms as a Performance and Capacity Engineer, working on AI inference and training infrastructure.
Apr 22, 2024 TraceUpscaler received the Best Paper Runner-Up Award at EuroSys 2024.
Apr 12, 2024 Defended my Ph.D. dissertation at Penn State: “Improving the Fidelity of Trace-Driven Experiments in Cloud Computing Systems.”
Mar 15, 2023 Kerveros accepted to OSDI 2023. The system is deployed in Azure production.

selected publications

  1. ACM TOCS
    TraceScaler: A Framework for Scaling Load in Real-World Traces for System Evaluation
    Sultan Mahmud Sajal, Md Salman Estyak, Rubaba Hasan, and 3 more authors
    ACM Transactions on Computer Systems, 2025
    Invited paper
  2. EuroSys
    TraceUpscaler: Upscaling Traces to Evaluate Systems at High Load
    Sultan Mahmud Sajal, Timothy Zhu, Bhuvan Urgaonkar, and 1 more author
    In Proceedings of the 19th European Conference on Computer Systems (EuroSys), 2024
  3. OSDI
    Kerveros: Efficient and Scalable Cloud Admission Control
    Sultan Mahmud Sajal, Luke Marshall, Beibin Li, and 8 more authors
    In Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2023
    Deployed in Azure production
  4. EuroSys
    TraceSplitter: A New Paradigm for Downscaling Traces
    Sultan Mahmud Sajal, Rubaba Hasan, Timothy Zhu, and 2 more authors
    In Proceedings of the 16th European Conference on Computer Systems (EuroSys), 2021