





















































It looks like the AI giants are battling it out, with announcements on new models, Gen-AI capabilities for their flagship products, and research breakthroughs. But don’t you worry, we’ve got you. Here is your weekly digest!
LLM Expert Insights Team,
Packt
Known for their pioneering research in reinforcement learning Barto and Sutton’s decades long research has shaped AI agents, robotics, and gaming. The 2024 Turing Award recognizes their profound contribution to AI and ML.
1. Deutsche Telekom’s AI phone
Deutsche’s upcoming AI phone, equipped with an AI assistant powered by Perplexity, will be available to the public later this year.
2. OPPO Announces Enhanced AI Strategy
OPPO has announced its AI strategy, featuring innovations like AI Call Translator and AI VoiceScribe to level up their mobile AI experiences.
3. Stability AI and Arm Bring On-Device Generative Audio to Smartphones
Stability AI and Arm’s partnership is set to enable high-quality sound effects and audio sample generation directly on mobile devices, making it 30x faster on Arm CPUs.
4. Google Showcases Android’s AI and Gemini Features; Wins Two GLOMO Awards at MWC 2025
Google demoed Android AI Core, featuring smart replies and text summarization, powered by Gemini Nano. Google’s Gemini won the Breakthrough Device Innovation highlighting Google’s leadership in AI for mobile. Pixel Pro, additionally, was named Smartphone of the Year.
Google is testing AI Mode in Labs, an experimental search experience, for its Google One AI Premium subscribers. Powered by Gemini 2.0, AI Mode expands on AI Overviews offering more advanced reasoning, thinking and multimodal capabilities.
Gemini Live will support multilingual conversations in 45+ languages and expand iPixel’s multimodal capabilities, with support from Gemini Nano for On-device AI.
Cortical Labs, an Australian startup, introduced CL1, the world's first commercial biological computer, at MWC. This "body in a box" uses living human brain cells to grow neurons capable of learning and processing information biologically, consuming far less energy than traditional AI. This “Wetware-as-a-Service” computer is set to launch in the second half of 2025.
Now, Gemini can analyze live videos with its vision capabilities. Users can share their screen or stream videos directly from their device camera to receive real-time insights from Gemini. This update is expected to roll out for Google One AI Premium users later this month.
Opera is testing an AI agent integrated into its browser. With this native AI agent, Opera aims to offer efficiency and user control while assisting with browsing tasks.
Anthropic has closed a Series E funding round, bringing its post-money valuation to $61.5 billion. This funding will support Anthropic’s expansion plans and the development of next-generation AI technology.
To create a culture of transparency and trust in AI, Anthropic also launched the Transparency Hub to provide information about its AI models, safety research, model evaluations, and methodologies.
Apptronik and Jabil have teamed up to build and integrate humanoid robots for tasks like inspection, sorting, and delivery.
Sanctuary AI is equipping its Phoenix humanoid robots with tactile sensors to enhance dexterity and precision in handling delicate tasks. This upgrade will improve Phoenix’s manipulation capabilities for real-world applications by introducing a sense of touch.
CEO Bret Adcock announced that Helix will enter Alpha testing this year, with the humanoid expected to reach households earlier than anticipated.
The AWS Center for Quantum Computing has introduced Ocelot, a new quantum computing chip designed to make quantum computing more feasible. The Ocelot prototype aims to reduce the cost of quantum error correction by up to 90% compared to existing methods.
Alibaba has released QWEN-32B that uses reinforcement learning. Designed to be highly performant, QWQ-32B reports results comparable to much larger models.
Google has now released its new AI agent for Colab in select countries and languages. Designed for users 18 and older, this Data Science Agent simplifies data analysis by automating Jupyter notebook creation from text prompts. It can handle tasks like data loading, library imports, exploratory analysis, and visualization code generation.
Cohere AI has introduced Aya Vision, a state-of-the-art vision model designed to bridge language gaps in AI, especially for multimodal tasks combining text and images. Aya Vision can perform image captioning, visual question answering, and text generation across 23 languages. Available in 8B and 32B parameter sizes, the model is accessible via open-source platforms and WhatsApp for research and non-commercial use.
Gitingest is an open-source tool that converts Git repositories into text for LLMs. It simplifies code analysis and AI solutions by providing a structured, prompt-friendly text digest of codebases. Features include smart formatting, statistics on file structure, and CLI/Python package usage.
Wispr Flow is an AI voice dictation tool that uses real-time voice-to-text conversion to allow users to type up to three times faster. It features AI commands, auto-editing, and supports over 100 languages. Context-aware and adaptable to individual speech patterns, it caters to professionals, writers, and students, with tiered pricing options.
Google Research has introduced Confidential Federated Analytics (CFA), a privacy-preserving technique that prioritizes user privacy while discovering new words to improve search engines. CFA analyzes anonymized and aggregated search query data from numerous devices, without inspecting individual queries directly. This technique helps identify emerging words and trends, improving search quality, particularly for low-resource languages.
Selene-1 is a powerful LLM evaluator model equipped with absolute scoring, classification, and pairwise preference capabilities. With customizable evaluations and chain-of-thought critiques, Selene-1 can detect hallucinations and verify the accuracy of LLM responses.
Recently launched Sesame AI employs a Conversational Speech Model (CSM) to create human-computer interaction interfaces using speech and natural language.
Gitingest is an open-source tool that converts Git repositories into text for LLMs. It simplifies code analysis and AI solutions by providing a structured, prompt-friendly text digest of codebases. Features include smart formatting, statistics on file structure, and CLI/Python package usage.
Wispr Flow is an AI voice dictation tool that uses real-time voice-to-text conversion to allow users to type up to three times faster. It features AI commands, auto-editing, and supports over 100 languages. Context-aware and adaptable to individual speech patterns, it caters to professionals, writers, and students, with tiered pricing options.
Google Research has introduced Confidential Federated Analytics (CFA), a privacy-preserving technique that prioritizes user privacy while discovering new words to improve search engines. CFA analyzes anonymized and aggregated search query data from numerous devices, without inspecting individual queries directly. This technique helps identify emerging words and trends, improving search quality, particularly for low-resource languages.
Recently launched Sesame AI employs a Conversational Speech Model (CSM) to create human-computer interaction interfaces using speech and natural language.
OpenAI has launched NextGenAI, a consortium of 15 research institutions, backed by $50 million in grants, compute funding, and API access. This initiative supports students, educators, and researchers in pushing the boundaries of AI knowledge and preparing future AI leaders. Founding partners include Caltech, Duke, Harvard, MIT, Oxford, and more, alongside institutions like Boston Children's Hospital and the Boston Public Library.
DeepSeek AI has introduced SmallPond, a lightweight data processing framework designed for high-performance AI training and inference on large datasets. Built on DuckDB and DeepSeek's 3FS, it efficiently processes petabytes of data using distributed processing with Ray.
📢 If your company is interested in reaching an audience of developers and, technical professionals, and decision makers, you may want toadvertise with us.
If you have any comments or feedback, just reply back to this email.
Thanks for reading and have a great day!