Article • Oct 31, 2024
Phala Network Study Shows Impact of TEE on NVIDIA H100 and H200 Hopper GPUs

Summary
- TEE-on mode affects Time To First Token (TTFL) and Inter-Token Latency (ITL) differently in H100 and H200 GPUs.
- For typical LLM tasks, TEE introduces under 7% performance overhead with larger models experiencing nearly zero impact.
📣 Related news
Loading news...




