Devin is now generally availableAI_Distilled #80: Google introduces Gemini 2.0: A new AI model for the agentic eraZapier connects the apps you use every day, so you can focus on what matters most - Free to start.Learn MoreWelcome to AI_Distilled. Today, we’ll talk about:TechwaveDevin is now generally availableGoogle introduces Gemini 2.0: A new AI model for the agentic eraMeta Llama-3.3 70B-InstructGemini Flash - Google DeepMindI can now run a GPT-4 class model on my laptopAwesome AI:Retro Diffusion: The Future of Pixel Art is nowMagic Clips: Create Viral Clips From Long Videos, Instantlysoundfont-generator - a Hugging Face Space by erl-jPickle -Lifelike AI clones lip-syncing to your voice in real-timeShortcut by Poised12 days of OpenAI:Day 5: Apple launches its ChatGPT integration with SiriDay 4: OpenAI Canvas Kills Google Docs, Challenges VS Code & CursorDay 3: OpenAI has finally released SoraDay 2: Reinforcement Fine-Tuning Research ProgramDay 1: Introducing ChatGPT ProSecret Knowledge:Hugging Face's Text Generation Inference v3 overviewMeet Willow, our state-of-the-art quantum chipGrok Image Generation ReleaseThis is our final edition of AI_Distilled for 2024, but don’t worry—we’ll be back with more insights and updates in January 2025. In the meantime, we’ve got a little holiday treat for you!Packt has some exciting offers lined up to help you boost your tech skills and get ready for an amazing new year! It’s the perfect opportunity to relax, learn something new, and stay ahead in your field. Keep an eye out for these special holiday deals!From all of us at the Packt Newsletters team, we wish you a joyful holiday season and a fantastic start to 2025. See you next year!Cheers,Shreyans SinghEditor-in-Chief, PacktStop worrying about your to-do list.Zapier connects the apps you use every day, so you can focus on what matters most.Start working more efficiently - Create your free account today.Get started for free⚡ TechWave: AI/GPT News & AnalysisDevin is now generally availableDevin, a powerful AI tool for engineering teams, is now generally available starting at $500 per month. With no seat limits, integrations for Slack, IDEs, and APIs, and direct support from Cognition's engineering team, Devin is designed to tackle small frontend bugs, create first-draft PRs, and perform targeted code refactors. Teams can collaborate with Devin via Slack for task management, use its IDE extension for code reviews, and guide it with feedback to refine its output.Google introduces Gemini 2.0: A new AI model for the agentic eraGoogle unveiled Gemini 2.0, its next-generation AI model, designed for "agentic" capabilities, enabling AI to act proactively on behalf of users. The multimodal model can process and generate text, images, audio, and video while using tools like Google Search and code execution. Its experimental version, Gemini 2.0 Flash, is available to developers with enhanced performance and lower latency.Meta Llama-3.3 70B-InstructLlama 3.3 is a powerful multilingual AI model developed by Meta, designed for generating text and assisting in conversations across multiple languages. With 70 billion parameters, it uses advanced transformer architecture and aligns with human preferences through fine-tuning methods like RLHF. The model supports multilingual text input and output, offering robust performance in tasks like coding, reasoning, and multilingual understanding. It incorporates a long context window, tool use capabilities, and optimized inference using Grouped-Query Attention.Gemini Flash - Google DeepMindGemini 2.0, developed by Google DeepMind, is a cutting-edge AI model designed for a new era of "agentic" experiences, where AI systems can perform tasks using memory, reasoning, and planning under human supervision. This model features enhanced capabilities like native tool usage, real-time multimodal understanding (text, images, video, and audio), image generation, and text-to-speech. It excels in low-latency scenarios, enabling applications like coding assistance, game navigation, and interactive learning experiencesI can now run a GPT-4 class model on my laptopMeta’s Llama 3.3 70B is a groundbreaking language model that matches GPT-4’s capabilities and can run on consumer-grade laptops like a 64GB MacBook Pro M2. This remarkable feat showcases the rapid advances in AI model efficiency over the past two years, making high-quality AI tools more accessible than ever. By using tools like Ollama, users can now easily download and run these models locally, enabling powerful applications like text generation and coding assistance. The model has also performed competitively on benchmarks, cementing its position among leading LLMs. This progress highlights the potential for affordable, locally hosted AI, expanding its utility for developers and enthusiasts alike.💻 Awesome AI: Tools for WorkRetro Diffusion: The Future of Pixel Art is nowRetro Diffusion is a cutting-edge platform designed by artists to simplify and enhance the process of creating pixel art. It offers specialized tools that eliminate common frustrations, enabling creators to focus on their artistry rather than technical hurdles. With Retro Diffusion, artists can quickly achieve professional-level pixel art, transforming their creative visions with ease and efficiency.Magic Clips: Create Viral Clips From Long Videos, InstantlyMagic Clips is an AI-powered platform that transforms long videos into engaging, viral short clips instantly without the need for manual editing. Simply upload a video or paste a link, and the AI selects the most captivating moments, adds captions, and arranges them into shareable content. With features like unlimited uploads, transcript navigation, and performance optimization, Magic Clips helps users create and repurpose content efficiently.soundfont-generator - a Hugging Face Space by erl-jErl-j's Soundfont Generator is an AI tool that creates custom soundfonts based on text descriptions. Users simply input a prompt describing the desired audio (e.g., "hard bass" or "sparkly bells"), adjust the generation settings for quality or speed, and generate the soundfont. The tool allows users to preview the instrument using a virtual keyboard and export it as a downloadable SFZ soundfont package, compatible with SFZ samplers. Built on advanced audio models, it uses latent flow matching for faster and efficient generation, making it a powerful resource for musicians and audio designers.Pickle - Lifelike AI clones lip-syncing to your voice in real-timePickle lets you use a personalized AI clone to represent you in video calls, providing flexibility and freedom. Whether you're not camera-ready, multitasking, or taking a break, your clone seamlessly participates in meetings across any video platform. With customizable outfits and backgrounds, you can tailor your clone to suit your needs.Shortcut by PoisedShortcut is an AI-powered tool that transforms the way you work by enabling natural voice-based interaction instead of typing. It lets you ask questions, organize ideas, and create polished drafts of messages, emails, and documents instantly, maintaining your productivity flow. With Shortcut, your spoken words are quickly refined into well-crafted text in your chosen tone—friendly, professional, or direct—eliminating the hassle of editing.🔛 12 days of OpenAIDay 5: Apple launches its ChatGPT integration with SiriApple has launched ChatGPT integration with Siri as part of its new iOS 18.2 update, enabling Siri to handle complex questions by seamlessly accessing OpenAI’s GPT-4o model with user permission. This marks a significant step in Apple's AI initiative, dubbed Apple Intelligence, which aims to enhance user experience with advanced tools like text rewriting, glowing Siri notifications, and app action capabilities coming next year. The integration prioritizes privacy, ensuring OpenAI doesn’t store user queries, and positions Apple as a leader in consumer AI while offering OpenAI exposure to millions of iPhone users.Day 4: OpenAI Canvas Kills Google Docs, Challenges VS Code & CursorOpenAI has introduced Canvas, a new feature within ChatGPT that provides a split-screen interface for drafting, editing, and coding, aiming to compete with tools like Google Docs, VS Code, and Cursor. Users can write or code on one side while receiving real-time suggestions and feedback from ChatGPT on the other. This feature supports Python code execution, debugging, and syntax highlighting, making it a robust tool for developers and writers alike. Beyond basic editing, users can format text, address AI-generated comments, and generate visual outputs using Python.Day 3: OpenAI has finally released SoraOpenAI has launched Sora, a groundbreaking text-to-video AI tool, offering users the ability to create 1080p videos up to 20 seconds long with the $200/month ChatGPT Pro subscription, or shorter 720p videos with ChatGPT Plus. Users can generate videos from text, animate images, remix existing videos, and even blend scenes with AI. Sora includes features like a storyboard tool for precise frame-by-frame input and a community feed showcasing creations. All videos come with watermarks and metadata to ensure transparency and prevent misuse.Day 2: Reinforcement Fine-Tuning Research ProgramOpenAI has launched the Reinforcement Fine-Tuning Research Program to enable developers and machine learning engineers to fine-tune AI models for domain-specific tasks. This technique involves training models using curated high-quality tasks and grading their responses against reference answers, improving reasoning and accuracy in specific fields like law, healthcare, and finance. Participants in the program gain alpha access to the Reinforcement Fine-Tuning API to test its potential on their use cases and provide feedback ahead of its public release in 2025.Day 1: Introducing ChatGPT ProOpenAI has introduced ChatGPT Pro, a premium subscription plan costing $200 per month, offering enhanced access to its most advanced AI models and tools. This includes the powerful o1 Pro mode, which uses increased computational resources to provide more accurate and comprehensive answers, especially for complex tasks in data science, programming, and advanced research. External evaluations highlight its superior performance across challenging benchmarks like competitive math, coding, and science problems.🚀 Secret KnowledgeHugging Face's Text Generation Inference v3 overviewHugging Face's Text Generation Inference (TGI) v3 delivers significant performance enhancements for handling large language models (LLMs). It processes three times more tokens and is 13 times faster than its competitor vLLM for long prompts, thanks to optimized memory usage, efficient prefix caching, and streamlined configurations that require no manual setup. TGI also improves hardware utilization, making it adaptable for both small-scale and high-performance deployments. Benchmarks confirm these gains across various scenarios, showcasing faster responses for long conversations and complex prompts.Meet Willow, our state-of-the-art quantum chipGoogle's latest quantum chip, Willow, represents a significant leap forward in quantum computing, addressing long-standing challenges in error correction and performance. Willow demonstrates the ability to reduce errors exponentially as more qubits are added, solving a decades-old problem in quantum error correction. It also performed a benchmark computation in under five minutes, a task that would take the fastest classical supercomputers 10 septillion years, highlighting its unmatched processing power. With 105 qubits and breakthroughs in chip design, Willow is a major milestone toward building large-scale, practical quantum computers capable of tackling real-world problems and advancing scientific discovery.Grok Image Generation ReleaseGrok's new image generation model, Aurora, brings cutting-edge capabilities to the 𝕏 platform, offering photorealistic rendering and precise adherence to text prompts. Trained on billions of text and image examples, Aurora supports multimodal input, enabling users to generate original images, edit existing ones, and create artistic or realistic visuals with exceptional detail. Its versatility spans entity creation, artistic designs, and realistic human portraits.📢 If your company is interested in reaching an audience of developers and, technical professionals, and decision makers, you may want toadvertise with us.If you have any comments or feedback, just reply back to this email.Thanks for reading and have a great day!*{box-sizing:border-box}body{margin:0;padding:0}a[x-apple-data-detectors]{color:inherit!important;text-decoration:inherit!important}#MessageViewBody a{color:inherit;text-decoration:none}p{line-height:inherit}.desktop_hide,.desktop_hide table{mso-hide:all;display:none;max-height:0;overflow:hidden}.image_block img+div{display:none}sub,sup{font-size:75%;line-height:0} @media (max-width: 100%;display:block}.mobile_hide{min-height:0;max-height:0;max-width: 100%;overflow:hidden;font-size:0}.desktop_hide,.desktop_hide table{display:table!important;max-height:none!important}}
Read more