Tag: network
-
Cloud Blog: Data loading best practices for AI/ML inference on GKE
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/improve-data-loading-times-for-ml-inference-apps-on-gke/ Source: Cloud Blog Title: Data loading best practices for AI/ML inference on GKE Feedly Summary: As AI models increase in sophistication, there’s increasingly large model data needed to serve them. Loading the models and weights along with necessary frameworks to serve them for inference can add seconds or even minutes of scaling…
-
Cloud Blog: 65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting/ Source: Cloud Blog Title: 65,000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models Feedly Summary: As generative AI evolves, we’re beginning to see the transformative potential it is having across industries and our lives. And as large language models (LLMs) increase in size — current models are reaching…
-
Cloud Blog: Unlocking LLM training efficiency with Trillium — a performance analysis
Source URL: https://cloud.google.com/blog/products/compute/trillium-mlperf-41-training-benchmarks/ Source: Cloud Blog Title: Unlocking LLM training efficiency with Trillium — a performance analysis Feedly Summary: Rapidly evolving generative AI models place unprecedented demands on the performance and efficiency of hardware accelerators. Last month, we launched our sixth-generation Tensor Processing Unit (TPU), Trillium, to address the demands of next-generation models. Trillium is…
-
The Register: HPE goes Cray for Nvidia’s Blackwell GPUs, crams 224 into a single cabinet
Source URL: https://www.theregister.com/2024/11/13/hpe_cray_ex/ Source: The Register Title: HPE goes Cray for Nvidia’s Blackwell GPUs, crams 224 into a single cabinet Feedly Summary: Meanwhile, HPE’s new ProLiant servers offer choice of Gaudi, Hopper, or Instinct acceleration If you thought Nvidia’s 120 kW NVL72 racks were compute dense with 72 Blackwell accelerators, they have nothing on HPE…
-
The Register: China’s Volt Typhoon crew and its botnet surge back with a vengeance
Source URL: https://www.theregister.com/2024/11/13/china_volt_typhoon_back/ Source: The Register Title: China’s Volt Typhoon crew and its botnet surge back with a vengeance Feedly Summary: Ohm, for flux sake China’s Volt Typhoon crew and its botnet are back, compromising old Cisco routers once again to break into critical infrastructure networks and kick off cyberattacks, according to security researchers.… AI…
-
The Register: HTTP your way into Citrix’s Virtual Apps and Desktops with fresh exploit code
Source URL: https://www.theregister.com/2024/11/12/http_citrix_vuln/ Source: The Register Title: HTTP your way into Citrix’s Virtual Apps and Desktops with fresh exploit code Feedly Summary: ‘Once again, we’ve lost a little more faith in the internet,’ researcher says Researchers are publicizing a proof of concept (PoC) exploit for what they’re calling an unauthenticated remote code execution (RCE) vulnerability…