• Human In The Loop
  • Posts
  • AI code review revolution 🤖, Gemini 2.5 Pro’s video vision 📱, Midjourney animates images 🎬

AI code review revolution 🤖, Gemini 2.5 Pro’s video vision 📱, Midjourney animates images 🎬

🛠️ Product Updates

Coding

In a strategic nod to the AI-enhanced coding landscape, Qodo Merge Pro (Qodo Merge 1.0) debuts with Retrieval-Augmented Generation for smarter pull request summaries. This doesn't surprise but aligns with ongoing industry trends addressing manual review inefficiencies. For developers, the buzzword fatigue ends with practical gains—context-aware merging improves code quality, accelerates reviews, and relieves burnout. Amid fierce competition, Qodo's blend of AI sophistication hints at setting new benchmarks, ensuring AI isn't just a helper, but a hero in CI/CD pipelines.

GitHub has launched Copilot Spaces in public preview, enabling developers to turn Copilot into a savvy team insider by packaging code, documents, and custom instructions in reusable "spaces." This leap addresses AI's historical blind spot: organizational knowledge. Unlike its competitors, GitHub deftly integrates this context directly with its platform, aligning Copilot closely with project needs while respecting permissions. For developers, this means more precise code suggestions and enhanced collaborative capabilities, potentially giving Copilot Spaces the edge in the bustling AI assistant arena.

GitHub Copilot has upgraded for Visual Studio 2022 and Visual Studio Code, unveiling Agent mode, Next Edit Suggestions (NES), and Model Context Protocol (MCP) integration. These features elevate Copilot from a mere autocomplete tool to an AI-powered coding assistant, highlighting Microsoft's strategic edge. Agent mode automates coding tasks, while MCP expands integration with external tools. For developers, this means smarter, context-aware coding, notably advancing .NET productivity in Visual Studio. VS Code, meanwhile, leans into extensibility, enhancing its appeal to diverse coding preferences.

Text

Google rolls out Gemini 2.5 Pro and Flash with video upload and analysis capabilities, marking a strategic leap in multimodal AI. Users can now upload video clips on Android and web apps, allowing Gemini to summarize, identify elements, or provide answers—tasks ChatGPT can’t yet manage with uploaded videos. While real-time analysis remains out of reach, Google’s offering aligns with their roadmap, expanding beyond competitors and setting the stage for broader enterprise adoption. And yes, your smartphone just got a new set of eyes.

Anthropic's Claude 4 Opus debuts as a frontrunner in multimodal AI, excelling in interpreting diverse data formats—text, images, tables, code, audio, and video—with pronounced safety and ethical standards. Surpassing rivals such as OpenAI's ChatGPT, Google's Gemini, and xAI's Grok in document reasoning and compliance, Claude 4 Opus targets sensitive fields like healthcare and finance. Its integration into Google Cloud's Vertex AI shows Anthropic's commitment to broad accessibility and robust cloud deployment, underscoring the growing trend of hybrid AI workflows. Who knew AI ethics could be this competitive?

Quora’s Poe AI is breaking onto the scene by aggregating multiple AI models, such as ChatGPT, Claude, and Llama, into a single interface, offering users a versatile chat experience. While this multi-model approach caters to casual users valuing diversity over depth, OpenAI’s ChatGPT stays ahead with tools like code execution and DALL·E image generation. Poe provides free access with limits, or for $19.99/month, tempting those who enjoy AI window shopping rather than diving deep into single-model ecosystems.

Google's Gemini AI Pro and Ultra have introduced "Scheduled Actions," allowing Android users to automate routine tasks like daily summaries or morning briefings at specific times. This feature, exclusive to premium subscribers, replaces Google Assistant's Routines, marking a strategic shift towards more personalized digital assistance. As competition with AI giants like OpenAI heats up, Google aims to stand out by embedding advanced automation into Gemini. Though your coveted to-do list could now manage itself, it won't come without a subscription.

Google's unveiled "Search Live with voice support" in the U.S., a fresh feature within its AI Mode experiment, lets users chat smoothly with Google Search much like AI chatbots. This move seems like a direct response to rivals such as ChatGPT and Gemini Live, aiming for dominance in conversational search. For the tech-savvy, this means hands-free interaction and keeping up with the conversation without reopening new searches—a boon when multitasking is your middle name.

Anthropic has unveiled four new beta capabilities for its API, enhancing the Claude Opus 4 and Sonnet 4 models. Highlights include a sandboxed code execution tool for running Python scripts and a seamless MCP connector for integrating with services like Zapier and Asana. With document management and extended caching features, developers can build AI agents with elevated functionality. Anthropic is strikingly broadening its reach by addressing practical developer needs with these versatile tools, aiming to outpace competitors in the AI agent space.

Image

Midjourney enters the AI video market with V1, an extension of its image model that animates pictures into 5- to 20-second clips using text prompts. While this model lacks sound and advanced editing, its integration into existing subscriptions and roughly eightfold cost over image generation offer users a creative twist on their current toolkit. Launch timing, amid Disney and Universal's copyright lawsuits, suggests Midjourney is both ambitious and undeterred by potential legal hurdles—after all, who needs sound when you’re making waves?

Agents

Quora's Poe AI update cleverly aggregates AI models such as ChatGPT, Claude, and Gemini into a single mobile-friendly chat interface. This strategic pivot allows users to sample different models with ease, a nod to the growing demand for variety in the bustling AI landscape. Unlike OpenAI's ChatGPT, which offers deep integration with advanced tools for $20/month, Poe keeps it simple and affordable at $19.99/month. As tech-savvy users juggle choices, Poe shines in its simplicity and flexibility for casual exploration and light customization.

đź§Ş Use Cases

Text

Oracle teams up with Elon Musk’s xAI to offer the Grok 3 AI model on Oracle Cloud Infrastructure (OCI), transforming enterprise AI tasks like content creation and business automation. This integration reflects Oracle's strategy to be a "neutral platform," integrating top-tier AI models rather than developing its own, thereby boosting customer choice and flexibility. Telecommunication provider Windstream is already exploring these models to streamline operations, highlighting the practical impact for businesses needing agile AI solutions. Who knew AI could work this seamlessly?

The Laver Cup has teamed up with Perplexity, an AI answer engine, as its Official Global Answer Engine starting with the 2025 tournament. Known for processing over 650 million questions monthly, Perplexity will enhance fan engagement by delivering real-time match insights, scores, and statistics, bringing tech-driven depth to the viewer experience. While tennis purists might fear AI overreach, this collaboration pushes fan interaction into new territory, making AI an ace up sports fans’ sleeves—far from love-all, it seems.

Coding

Qodo.ai is enhancing its code review process with AI-powered Qodo Merge, incorporating RAG for context-aware reviews. Strategic insight? By embracing AI, they're joining industry leaders like GitHub Copilot in the AI-driven code improvement race—a competition heating up as everyone seeks a smarter edge. The practical upshot: developers can now catch more bugs and improve code quality faster, focusing on significant issues rather than mundane checks. Trust an AI-assisted review to make your next release smoother and faster.

 

Reply

or to participate.