Category: Uncategorized

엔비디아, OCP에 블랙웰 플랫폼 기본 설계 제공··· “AI 인프라 개발 촉진 목적”

October 16, 2024

Uncategorized

엔비디아가 블랙웰 가속 컴퓨팅 설계의 기본 요소를 OCP에 제공한다. 또한, OCP 표준에 대한 엔비디아 스펙트럼-X(Spectrum-X) 지원을 확대할 예정이다. 엔비디아는 15일부터 17일까지 열리는 올해 OCP 글로벌 서밋(OCP Global Summit)에서 OCP 커뮤니티와 엔비디아 GB200 NVL72 시스템의 전자 기계 설계의 주요 부분을 공유한다. 여기에는 더 높은 컴퓨팅 밀도와 네트워킹 대역폭을 지원하기 위한 랙 아키텍처, 컴퓨팅과 스위치 트레이 기계… Read more
인텔-AMD, 주요 IT 기업 12곳과 ‘x86 생태계 자문 그룹’ 발족

October 16, 2024

Uncategorized

성명서에 따르면 이 거대 기술 기업들은 아키텍처 상호 운용성에 대해 협력할 계획이며 세계에서 가장 널리 사용되는 x86 아키텍처 전반에서 “소프트웨어 개발을 단순화”하기를 기대하고 있다. 인텔 팻 겔싱어 CEO는 “x86 아키텍처 및 생태계에 있어 수십 년 만에 가장 중요한 변화의 정점에 우리는 서 있다”라며 “AMD 및 이 자문 그룹의 창립 멤버들과 함께 컴퓨팅의 미래를 밝힐 수… Read more
“학습만큼 망각이 필요”··· IBM이 강조하는 ‘LLM 언러닝’

October 16, 2024

Uncategorized

IBM 리서치의 사이언스 라이터(Science Writer)인 킴 마티노(Kim Martineau)가 ‘LLM에게 잊어버리라고 가르치는 이유’라는 블로그 콘텐츠를 통해 ‘대규모 언어 모델의 언러닝(large language model unlearning)’의 필요성과 중요성을 설명했다. 다음은 이를 요약한 내용이다. 머신 언러닝(Machine Unlearning)은 머신러닝(Machine Learning)의 반대 개념이다. 머신러닝이 다양한 데이터로 인공지능을 학습시켜 사람의 뇌처럼 기억하고 생각할 수 있도록 하는 기반을 만드는 것이라면, 머신 언러닝은 이러한 학습… Read more
의료 보안의 새로운 취약점··· 스마트 기기와 랜섬웨어 증가

October 16, 2024

Uncategorized

코로나19 팬데믹 이후 의료 서비스의 원격 부문을 노린 사이버 공격이 급증했다. 보안 공급업체와 연구자들은 의료 서비스 제공업체를 겨냥한 피싱 공격, 랜섬웨어, 웹 애플리케이션 공격 및 기타 위협이 크게 증가했다고 보고했다. 특히 올해 체인지 헬스케어(Change Healthcare) 침해 사건은 의료 업계 임원에게 큰 경각심을 불러일으키며 헤드라인을 장식했다. 이러한 추세는 의료 보안 조직에 막대한 부담을 안겼다. 사이버 보안… Read more
UL’s leap into the genAI evaluation business raises key questions

October 16, 2024

Uncategorized

UL Solutions, part of the UL enterprise that grew out of Underwriters Laboratories, on Monday jumped into the crowded genAI third-party evaluation service market, joining Stanford University and Microsoft, among many others, but with a more customized approach. The UL team will be asking questions as well as analyzing code. Some analysts and others in… Read more
Apple Engineers Show How Flimsy AI ‘Reasoning’ Can Be

October 16, 2024

Uncategorized

The new frontier in large language models is the ability to “reason” their way through problems. New research from Apple says it’s not quite what it’s cracked up to be. Read more
Arch-Function LLMs promise lightning-fast agentic AI for complex enterprise workflows

October 15, 2024

Uncategorized

Katanemo’s new Arch-Function LLMs promise 12x faster function-calling capabilities, empowering enterprises to build ultra-fast, cost-effective agentic AI applications.Read More Read more
Google AI Introduces Gemma-APS: A Collection of Gemma Models for Text-to-Propositions Segmentation

October 15, 2024

Uncategorized

The increasing reliance on machine learning models for processing human language comes with several hurdles, such as accurately understanding complex sentences, segmenting content into comprehensible parts, and capturing the contextual nuances present in multiple domains. In this landscape, the demand for models capable of breaking down intricate pieces of text into manageable, proposition-level components has… Read more
A New Study by OpenAI Explores How Users’ Names can Impact ChatGPT’s Responses

October 15, 2024

Uncategorized

Bias in AI-powered systems like chatbots remains a persistent challenge, particularly as these models become more integrated into our daily lives. A pressing issue concerns biases that can manifest when chatbots respond differently to users based on name-related demographic indicators, such as gender or race. Such biases can undermine trust, especially in name-sensitive contexts where… Read more
Neural Magic Unveils Machete: A New Mixed-Input GEMM Kernel for NVIDIA Hopper GPUs

October 15, 2024

Uncategorized

The rapid growth of large language models (LLMs) and their increasing computational requirements have prompted a pressing need for optimized solutions to manage memory usage and inference speed. As models like GPT-3, Llama, and other large-scale architectures push the limits of GPU capacity, efficient hardware utilization becomes crucial. High memory requirements, slow token generation, and… Read more