Advance Program
Tutorials: Sunday, August 27, 2023
Time (PDT) | Title | Presenters |
---|---|---|
7:45AM-8:30AM |
Breakfast/Registration |
|
8:30AM-10:30AM |
ML Inference |
|
ML Inference Overview |
Micah Villmow, NVIDIA | |
Quantization Methods for Efficient ML Inference |
Amir Gholami, UC Berkeley | |
ML Inference at the Edge |
Felix Baum, Qualcomm | |
10:30AM-11:00AM |
Coffee Break (1/2 hr) |
|
11:00AM-12:15PM |
ML Inference |
|
PyTorch 2.0 |
Elias Ellison, Meta | |
Hardware Requirements for Exploiting Sparsity in ML Inference |
Zhibin Xiao, Moffett AI | |
12:15PM-1:30PM |
Lunch (1 hr 15 min) |
|
1:30PM-3:30PM |
Chiplets/UCI |
|
UCIe Overview & Usage Models |
Debendra Das Sharma, Intel & Nathan Kalyansundharam, AMD | |
UCIe Protocol |
Swadesh Choudhary, Intel & Marvin Denman, NVIDIA | |
3:30PM-4:00PM |
Coffee Break (1/2 hr) |
|
4:00PM-5:30PM |
Chiplets/UCI |
|
Electrical, Form Factor and Compliance |
Anwar Kashem, AMD & Gerald Pasdast, Intel | |
Software, Manageability & Security |
Jérôme Glisse, Google & Sridhar Muthrasanallur, Intel | |
5:30PM-7:00PM |
Reception |
Conference Day 1: Monday, August 28, 2023
Time (PDT) | Title | Presenters |
---|---|---|
7:45AM-8:45AM |
Breakfast/Registration |
|
8:45AM-9:00AM |
Opening Remarks |
|
General Chair Welcome |
Gabriel Southern & Ron Diamant | |
Progam Co-Chairs Welcome |
Natalia Vassilieva & Heiner Litz | |
9:00AM-10:00AM |
Keynote #1 |
|
Exciting Directions for ML Models and the Implications for Computing Hardware |
Jeff Dean & Amin Vahdat, Google | |
10:00AM-11:00AM |
Processing in Memory |
|
Memory-centric Computing with SK Hynix’s Domain-Specific Memory |
Yongkee Kwon, SK Hynix | |
Samsung AI-cluster system with HBM-PIM and CXL-based Processing-near-Memory for transformer-based LLMs |
Jin Hyun Kim, Samsung | |
11:00AM-11:05AM |
Sachs Memorial |
|
Sachs Memorial |
Alan J Smith, UC Berkeley | |
11:05AM-11:30AM |
Coffee Break |
|
11:30AM-1:00PM |
CPU 1 |
|
ARM’s Neoverse V2 platform: leadership performance and power efficiency for next-generation cloud computing, ML and HPC workloads |
Magnus Bruce, ARM | |
AMD Next Generation “Zen 4” Core and 4th Gen AMD EPYCTM 9004 Server CPU |
Kai Troester & Ravi Bhargava, AMD | |
Ventana’s Veyron V1 Data Center-Class RISC-V Processor |
Greg Favor, Ventana | |
1:00PM-2:15PM |
Lunch (1 hr 15 min) |
|
2:15PM-4:15PM |
Platforms |
|
Architecting for Flexibility and Value with future Intel® Xeon® processors |
Chris Gianos, Intel | |
CSS-Genesis: Arm’s Neoverse N2 platform, delivered to partners as a fully verified, customizable compute sub-system |
Anitha Kona, ARM | |
Intel® Energy Efficiency Architecture |
Efraim Rotem, Intel | |
Caliptra: An Open-Source Root of Trust for Measurements |
Bharat Pillilli, Microsoft | |
4:15PM-4:45PM |
Coffee Break (1/2 hr) |
|
4:45PM-6:15PM |
CPU 2 |
|
Intel® Xeon® processors built on Efficient-core (E-Core): The Next Generation of High Performance, Energy-Efficient Computing |
Don Soltis, Intel | |
AMD Ryzen 7040 Series Mobile Processor |
Mahesh Subramony & David Kramer, AMD | |
Detailed Architecture Analysis and Key Features of SiFive’s latest high-performance out-of-order Vector Processor |
Brad Burgess, SiFive | |
6:15PM-7:45PM |
Reception |
Conference Day 2: Tuesday, August 29, 2023
Time (PDT) | Title | Presenters |
---|---|---|
7:45AM-8:30AM |
Breakfast/Registration |
|
8:30AM-9:00AM |
Poster Lighting Talks (2 minutes/poster) |
|
Poster Lighting Talks |
||
9:00AM-10:00AM |
Keynote #2 |
|
Hardware for Deep Learning |
Bill Dally, NVIDIA | |
10:00AM-11:00AM |
ML-Training |
|
A Machine Learning Supercomputer With An Optically Reconfigurable Interconnect and Embeddings Support |
Norman Jouppi & Andy Swing, Google | |
Inside the Cerebras Wafer-Scale Cluster |
Sean Lie, Cerebras | |
11:00AM-11:30AM |
Coffee Break (1/2 hr) |
|
11:30AM-1:00PM |
Interconnects |
|
NVIDIA’s Resource Transmutable Network Processing ASIC |
Kevin Deierling, NVIDIA | |
Hummingbird Low-Latency Computing Engine |
Maurice Steinman, Lightelligence | |
The First Direct Mesh-to-Mesh Photonic Fabric |
Jason Howard, Intel | |
1:00PM-2:15PM |
Lunch (1 hr 15 min) |
|
2:15PM-4:15PM |
ML-Inference |
|
IBM NorthPole Neural Inference Machine |
Dharmendra Modha, IBM | |
Moffett Antoum: A Deep-Sparse AI Inference System-on-Chip for Vision and Large Language Models |
Zhibin Xiao, Moffet AI | |
Qualcomm® Hexagon™ NPU |
Eric Mahurin, Qualcomm | |
Supercharged AI inference on modern CPUs |
Lawrence Spracklen & Subutai Ahmad, Numenta | |
4:15PM-4:45PM |
Coffee Break (1/2 hr) |
|
4:45PM-6:15PM |
FPGAs & Cooling |
|
AMD Next Generation FPGA Built From Chiplets |
Dinesh Gaitonde, AMD | |
Intel’s Agilex-9 Direct RF FPGAs with Integrated 64 GSPS Data Converters |
Benjamin Esposito, Intel | |
High Performance Cold Plates for Data Center Thermal Management via Electrochemical Additive Manufacturing (ECAM) |
Ian Winfield, Fabric8Labs | |
6:15PM-6:30PM |
Closing Remarks |
|
Closing Remarks |
Gabriel Southern & Ron Diamant |
Posters
Title | Authors & Affiliation |
---|---|
TrustForge: A Cryptographically Secure Enclave for Azure and AWS | Todd Austin, Valeria Bertacco and Alex Kisil; Agita Labs |
An Open-Source 130-nm Fusion-Enabled Deconvolution Kernel Generator IC For Real-Time mmWave Radar Platform Motion Compensation | Nikhil Poole, Priyanka Raina and Amin Arbabian; Stanford University |
A Scalable Multi-Chiplet Deep Learning Accelerator with Hub-Side 2.5D Heterogeneous Integration | Zhanhong Tan, Yifu Wu, Yannian Zhang, Haobing Shi, Wuke Zhang and Kaisheng Ma; Tsinghua University |
PHEP: Paillier Homomorphic Encryption Processors for Privacy-Preserving Applications in Cloud Computing | Guiming Shi, Yi Li, Xueqiang Wang, Zhanhong Tan, Dapeng Cao, Jingwei Cai, Yuchen Wei, Zehua Li, Wuke Zhang, Yifu Wu, Wei Xu and Kaisheng Ma; Tsinghua University |
HyperAccel LPU: Accelerating Hyperscale Models for Generative AI | Seungjae Moon, Junsoo Kim, Jung-Hoon Kim, Junseo Cha, Gyubin Choi, Seongmin Hong and Joo-Young Kim; HyperAccel / KAIST |
An Abstract of SiMa.ai’s MLSoC Architecture | Srivi Dhruvanarayan; SiMa.ai |
Shaheen: An Open, Secure, and Scalable RV64 SoC for Autonomous Nano-UAVs. | Luca Valente, Asif Hussain Chiralil Veeran, Mattia Sinigaglia, Yvan Tortorella, Alessandro Nadalini, Nils Wistoff, Bruno Sà, Angelo Garofalo, Rafail Psiakis, Mohammed Tolba, Ari Kulmala, Nimisha Limaye, Ozgur Sinanoglu, Sandro Pinto, Daniele Palossi, Luca Benini, Baker Mohammad and Davide Rossi; University of Bologna |
A Heterogeneous SoC for Bluetooth LE in 28nm | Felicia Guo, Nayiri Krzysztofowicz, Alex Moreno, Jeffrey Ni, Daniel Lovell, Yufeng Chi, Kareem Ahmad, Sherwin Afshar, Josh Alexander, Dylan Brater, Cheng Cao, Daniel Fan, Ryan Lund, Jackson Paddock, Griffin Prechter, Troy Sheldon, Shreesha Sreedhara, Anson Tsai, Eric Wu, Kerry Yu, Daniel Fritchman, Aviral Pandey, Ali Niknejad, Kristofer Pister and Borivoje Nikolic; University of California Berkeley |
Driving Compute Scale-out Performance with Optical I/O Chiplets in Advanced System-in-Package Platforms | Mark Wade, Chen Sun, Matt Sysak, Vladimir Stojanović, Pooya Tadayon, Ravi Mahajan and Babak Sabi; Ayar Labs |
A Heterogeneous RISC-V SoC for ML Applications in Intel 16 Technology | Yufeng Chi, Franklin Huang, Raghav Gupta, Ella Schwarz, Jennifer Zhou, Reza Sajadiany, Animesh Agrawal, Max Banister, Michelle Boulos, Jason Chandran, Jessica Dowdall, Leena Elzeiny, Claire Gantan, Anthony Han, Roger Hsiao, Chadwick Leung, Edwin Lim, Jose Rodriguez, Tushar Sondhi, Mitchell Twu, Rongyi Wang, Mike Xiao, Ruohan Yan, Paul Kwon, Zhaokai Liu, Jerry Zhao, Bob Zhou, Ali Niknejad, Kristofer Pister and Borivoje Nikolić; University of California, Berkeley |