{"id":5303,"date":"2024-02-19T07:51:24","date_gmt":"2024-02-19T00:51:24","guid":{"rendered":"https:\/\/wam.vn\/en\/?p=5303"},"modified":"2024-02-19T07:51:24","modified_gmt":"2024-02-19T00:51:24","slug":"the-most-insane-week-of-ai-news-so-far-this-year","status":"publish","type":"post","link":"https:\/\/wam.vn\/en\/the-most-insane-week-of-ai-news-so-far-this-year\/","title":{"rendered":"The Most Insane Week of AI News So Far This Year!"},"content":{"rendered":"<p><em><strong>TLDR<\/strong>: Google&#8217;s DeepMind Gemini 1.5 &amp; OpenAI&#8217;s Sora revolutionize AI with multimodal understanding, text-to-video generation, &amp; memory features. #AI #DeepMind #OpenAI #Sora #Gemini1.5.<\/em><\/p>\n<p>This article is a summary of a You Tube video &#8220;The Most Insane Week of AI News So Far This Year!&#8221; by Matt Wolfe<br \/>\n<iframe title=\"YouTube video player\" src=\"https:\/\/www.youtube.com\/embed\/ne7_PDthIYA?si=5z0Wt7lh-AfwcLdO\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<h3>Key Takeaways:<\/h3>\n<ol>\n<li><strong>Google&#8217;s DeepMind Announces Gemini 1.5<\/strong>: A new model using a mixture of experts architecture, enhancing efficiency by processing prompts through smaller, specialized language models.<\/li>\n<li><strong>Increased Context Window<\/strong>: Gemini 1.5 supports a 1 million token context window, allowing for processing approximately 750,000 words of input and output text, surpassing the capacity of previous models.<\/li>\n<li><strong>Advanced Multimodal Understanding<\/strong>: Demonstrated by analyzing a 44-minute silent Buster Keaton movie, identifying plot points and details without any textual data.<\/li>\n<li><strong>Exceptional Text Analysis Precision<\/strong>: Gemini 1.5 can accurately find specific information within large text blocks (up to 1 million tokens) 99% of the time in tests.<\/li>\n<li><strong>OpenAI&#8217;s Sora<\/strong>: A groundbreaking AI text-to-video model capable of generating up to 60-minute realistic videos from text prompts, showcasing superior realism in AI-generated content.<\/li>\n<li><strong>Sora&#8217;s Technical Capabilities<\/strong>: Includes generating videos from image prompts and seamlessly transitioning between video scenes, with potential for high-resolution image generation.<\/li>\n<li><strong>Memory Feature in ChatGPT<\/strong>: OpenAI introduces a memory feature, enabling ChatGPT to remember and utilize previous conversations for more contextually relevant interactions.<\/li>\n<li><strong>Andrej Karpathy&#8217;s New Projects<\/strong>: Following his departure from OpenAI, Karpathy hints at focusing on large-scale AI projects and educational content on his YouTube channel.<\/li>\n<li><strong>Stable Cascade Introduction<\/strong>: A new tool capable of image manipulation and enhancement, including features like in-painting and super-resolution.<\/li>\n<li><strong>Chat with RTX by Nvidia<\/strong>: A local, offline-capable large language model interface requiring Nvidia&#8217;s RTX 30 series or better GPUs, emphasizing the importance of hardware in AI advancements.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Google&#8217;s DeepMind Gemini 1.5 &#038; OpenAI&#8217;s Sora revolutionize AI with multimodal understanding, text-to-video generation, &#038; memory features. #AI #DeepMind #OpenAI #Sora #Gemini1.5.<\/p>\n","protected":false},"author":3,"featured_media":5304,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[40],"tags":[],"class_list":["post-5303","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","category-40","description-off"],"_links":{"self":[{"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/posts\/5303","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/comments?post=5303"}],"version-history":[{"count":1,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/posts\/5303\/revisions"}],"predecessor-version":[{"id":5305,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/posts\/5303\/revisions\/5305"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/media\/5304"}],"wp:attachment":[{"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/media?parent=5303"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/categories?post=5303"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wam.vn\/en\/wp-json\/wp\/v2\/tags?post=5303"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}