





















































Still struggling to achieve work-life balance and manage your time efficiently?
Join this 3 hour Intensive Workshop on AI & ChatGPT tools (usually $399) but FREE for first 100 people.
Welcome to AI_Distilled. Today, we’ll talk about:
Techwave:
[Sponsored] Learn AI strategies & hacks that less than 1% people know
Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents
Tencent Unveils GameGen-O: AI Model for game development
OpenAI o1 is oficially smarter than 95%+ of humans
Introducing the Runway API for Gen-3 Alpha Turbo
Announcing Pixtral 12B by Mistral AI
Awesome AI:
Adobe Firefly Video Model preview
Thunderbit | Personalized Web AI Copilot
Verse: Make free digital pages
Masterclass:
GenAI for YouTubers- Google DeepMind
The Basics Behind AI Models for Self-Driving Cars
What is the Chinchilla Scaling Law?
Improve RAG performance using Cohere Rerank
MIT researchers have developed "Co-LLM"
HackHub:
Upscayl: free and open source AI image upscaler
Anthropic-quickstarts: build deployable applications using the Anthropic API
Multi-GPT: An experimental open-source attempt to make GPT-4 fully autonomous
Facebook Audioseal: Localized watermarking for AI-generated speech audios
💡Recommended Reading: Unlocking the Secrets of Prompt Engineering
Cheers!
Shreyans Singh
Editor-in-Chief, Packt
Salesforce has announced new innovations in Slack that turn AI agents into active teammates, enhancing productivity. New features include a unified work system that integrates Salesforce CRM data with Slack channels, AI-powered huddle notes, automation tools, and tailored templates for various tasks.
Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents
This update includes "Copilot Pages," a new collaborative workspace for AI and human interaction, allowing real-time editing and collaboration. Microsoft is also expanding Copilot's capabilities in Excel, now integrating Python for advanced data analysis, and in PowerPoint for more dynamic presentations. Additionally, Copilot in Teams and Outlook improves meeting and email management, while "Copilot Agents" automate business processes.
Tencent Unveils GameGen-O: AI Model for game development
Tencent has unveiled GameGen-O, an AI model designed to revolutionize game development by quickly generating vast and detailed open-world environments. This technology can use videos and images from the internet to create complex landscapes, reducing the need for manual data collection trips. GameGen-O aims to streamline the development process, allowing developers to focus on creativity while the AI handles the heavy lifting.
OpenAI o1 is oficially smarter than 95%+ of humans
OpenAI’s latest AI model, "o1," has demonstrated an IQ level higher than 95% of humans, according to recent testing by TrackingAI, a project that monitors AI intelligence across verbal and vision-based assessments. The project conducts regular evaluations of various AI systems using a range of tests, including Mensa-level IQ assessments. The performance of "o1" showcases the rapid advancements in AI capabilities.
Introducing the Runway API for Gen-3 Alpha Turbo
Runway has launched a new API for its Gen-3 Alpha Turbo model, allowing developers to integrate advanced AI capabilities into various applications and products.
Announcing Pixtral 12B by Mistral AI
Pixtral 12B is a new multimodal AI model that excels in both image and text understanding. It features a 400M parameter vision encoder and a 12B parameter multimodal decoder. Pixtral can handle different image sizes and aspect ratios, and process multiple images within a large context window of 128K tokens.
Learn how to integrate AI agents with databases using tools like LangChain and OpenAI.
It covers topics such as setting up AI agents, working with CSV and SQL databases, using OpenAI's function calling capabilities, and leveraging the Assistants API.
The course is designed for people with intermediate knowledge of Python and SQL, and it uses tools like Streamlit and LangChain.
Adobe Firefly Video Model preview
Adobe has introduced its new Firefly Video Model, a generative AI tool designed to enhance video editing within Adobe's software like Premiere Pro. It enables users to generate videos using text prompts, create atmospheric elements like fire or water, fill timeline gaps, and even bring still images to life.
Reddit Scout is a tool that quickly summarizes Reddit comments to help users find the best products to buy, saving time sifting through lengthy threads. It provides a detailed summary of discussions on various topics, such as smart home security systems, and is available as a Chrome extension.
This platform offers AI-generated audio discussions on various topics, transforming written content into engaging audio summaries. Each entry provides a concise audio summary of key papers and articles, making complex information easily accessible.
Thunderbit | Personalized Web AI Copilot
Thunderbit is an AI-powered tool designed to help business users automate various web tasks. It offers features like AI Web Clipper for extracting essential details from websites, voice note-taking to convert voice into structured notes, and AI-assisted data sync between business tables.
Verse: Make free digital pages
Verse is an app that turns your music taste into a visual representation of your personal space, like a digital bedroom inspired by the songs you listen to. It lets you explore and download creative content, from music and art to guides and reviews.
Empowering YouTube creators with generative AI - Google DeepMind
Google DeepMind is introducing generative AI tools, Veo and Imagen 3, to YouTube creators through a feature called Dream Screen. This will allow users to generate creative video backgrounds for YouTube Shorts by starting with a text prompt and choosing from four AI-generated images. Veo will then turn the selected image into a high-quality 6-second video clip.
The Basics Behind AI Models for Self-Driving Cars
This article explains how AI models for self-driving cars work by simulating driving behaviors using sensor data and a neural network. It outlines the basic mechanics: cars are equipped with sensors that detect proximity to objects in all directions, and the model uses this data to predict acceleration, braking, and steering. The neural network is trained on synthetic data that mimics human driving decisions, such as how much to turn or accelerate based on obstacles. A five-layer neural network built with PyTorch is used to train the model, which is evaluated based on its accuracy and crash rates.
What is the Chinchilla Scaling Law?
The Chinchilla Scaling Law, introduced in 2022, proposes that smaller language models can outperform larger ones if trained on significantly more data. Traditional models like GPT-3 increased in size without proportionally scaling the training data, leading to inefficiencies. The Chinchilla Scaling Law suggests an optimal balance between model size and data, showing that doubling the amount of data for every doubling of model size can maximize performance with the same compute resources.
Improve RAG performance using Cohere Rerank
Cohere Rerank helps improve RAG's performance by reordering retrieved documents based on a relevance score using deep learning. This second-stage process refines the results by aligning them more closely with user queries, boosting search accuracy and efficiency. Cohere Rerank can be integrated easily with tools like Amazon SageMaker.
MIT researchers have developed "Co-LLM"
MIT researchers have developed "Co-LLM," an algorithm that enables large language models (LLMs) to collaborate for more accurate and efficient solutions. It pairs a general-purpose model with a specialized expert model, with a "switch variable" that identifies when the general model needs help. This process allows the general model to handle most of the response, while the expert model steps in only when needed, improving accuracy and efficiency. The approach mimics how humans consult experts for specific tasks.
Upscayl is a free, open-source AI-powered image upscaler that lets you enhance and enlarge low-resolution images without losing quality. The tool uses advanced AI algorithms like Real-ESRGAN. You'll need a Vulkan-compatible GPU for best results.
Roop is an AI-based face-swapping tool that allows you to replace the face in a video with a face of your choice using just a single image—no training or large datasets required. Once set up, you can swap faces in videos by specifying source and target files through command-line options.
anthropics/anthropic-quickstarts
Anthropic Quickstarts is a set of projects that help developers easily build and deploy applications using the Anthropic API. These quickstarts offer a solid foundation for various applications, starting with a customer support agent powered by Claude, Anthropic's AI.
Multi-GPT is an experimental system where multiple specialized GPT models, known as "ExpertGPTs," work together to accomplish tasks. Each expert has its own memory (both short and long-term) and can communicate with other experts to solve complex problems. The system integrates advanced capabilities like internet searches, file storage, and long-term data recall. Users can interact with it by setting tasks, and the experts will collaborate autonomously to complete them, leveraging GPT-4 for text generation and optional tools like Pinecone for memory storage.
AudioSeal is a speech watermarking method that embeds invisible watermarks into audio, making it possible to detect watermarked segments even after editing. It uses a generator to create watermarks and a detector to find them in real-time with high accuracy, operating up to 100 times faster than existing models.
📢 If your company is interested in reaching an audience of developers and, technical professionals, and decision makers, you may want toadvertise with us.
If you have any comments or feedback, just reply back to this email.
Thanks for reading and have a great day!