Tag: triton

  • Cloud Blog: Data loading best practices for AI/ML inference on GKE

    Source URL: https://cloud.google.com/blog/products/containers-kubernetes/improve-data-loading-times-for-ml-inference-apps-on-gke/ Source: Cloud Blog Title: Data loading best practices for AI/ML inference on GKE Feedly Summary: As AI models increase in sophistication, there’s increasingly large model data needed to serve them. Loading the models and weights along with necessary frameworks to serve them for inference can add seconds or even minutes of scaling…

  • Cloud Blog: LummaC2: Obfuscation Through Indirect Control Flow

    Source URL: https://cloud.google.com/blog/topics/threat-intelligence/lummac2-obfuscation-through-indirect-control-flow/ Source: Cloud Blog Title: LummaC2: Obfuscation Through Indirect Control Flow Feedly Summary: Written by: Nino Isakovic, Chuong Dong Overview This blog post delves into the analysis of a control flow obfuscation technique employed by recent LummaC2 (LUMMAC.V2) stealer samples. In addition to the traditional control flow flattening technique used in older versions, the…

  • Hacker News: Liger-kernel: Efficient triton kernels for LLM training

    Source URL: https://github.com/linkedin/Liger-Kernel Source: Hacker News Title: Liger-kernel: Efficient triton kernels for LLM training Feedly Summary: Comments AI Summary and Description: Yes Summary: The Liger Kernel is a specialized Triton kernel collection aimed at enhancing LLM (Large Language Model) training efficiency by significantly improving throughput and reducing memory usage. It is particularly relevant for AI…