AI Tools Nav
HomeToolsCompareGuideNewsSkills
中
AI Tools Nav

Curated AI tools directory — from choosing to mastering, all in one place.

RSSAPI

Navigation

  • Home
  • Tools
  • Compare
  • Guide
  • News
  • Skills

Platform

  • Overview
  • API
  • RSS
  • Submit

About

  • About Us
  • Changelog
© 2026 AI Tools Nav - AI Tools Directory
AI News

AI News Brief | 2026-05-18

OpenAI unveils GPT-5.5 and merges ChatGPT with Codex; Cursor eyes $50B valuation; ElevenLabs hits $500M ARR; DeepSeek, Kimi, and others race for funding; and AI deepfakes spark global concern.

2026-05-18

AI Industry Overview

A fresh wave of concern over AI-generated deepfakes is being labeled a “new wave of violence” against women globally, as non-consensual synthetic media spreads rapidly. Meanwhile, EU lawmakers are assessing readiness for emerging cyber-capable AI models, and new data suggests a quarter of recent layoffs have been attributed to AI automation. On the product side, Coupa launched Compose and Catalyst for agentic AI, Mixpanel introduced always-on product intelligence, and OpenAI officially introduced GPT-5.5, marking a new generation of its flagship model. Startup activity remains intense, with a wave of AI product launches in May reshaping coding, search, and enterprise automation.

OpenAI & ChatGPT

OpenAI dropped a barrage of announcements. It merged ChatGPT and Codex under Greg Brockman, signaling a unified agentic platform. ChatGPT also gained new personal finance tools, aiming to become your money manager. On the model front, GPT-5.5 Instant was unveiled as a smarter, clearer, and more personalized variant. Meanwhile, the DALL·E API shut down on May 12, pushing users to migrate to GPT Image 2. OpenAI also officially discontinued Sora, its AI video generator, consolidating its creative tools portfolio.

Cursor

The AI coding editor is reportedly in talks to raise $2 billion at a $50 billion valuation after hitting $2B ARR in just three years. Cursor 3.3 shipped with build in parallel, split PRs, and unified PR review, further streamlining the developer workflow.

ElevenLabs

Voice AI unicorn ElevenLabs has blown past $500 million in annual recurring revenue and is adding an engineer to every team—including sales and legal—as part of a “vibe coding” push, its CEO told Business Insider. The company also announced additional funding alongside the milestone.

DeepSeek

The Chinese AI lab rolled out image recognition ahead of its V4.1 update and is reportedly seeking a record $7.3 billion funding round. However, its latest model release didn’t wow markets, which are growing accustomed to rapid-fire innovation.

Claude (Anthropic)

A severe RCE flaw in Claude Code was disclosed that could let attackers execute commands via malicious deeplinks. On a brighter note, Anthropic held its Code w/ Claude SF 2026 event, showcasing exponential AI building. PwC expanded its partnership, deploying Claude across technology, deals, and enterprise reinvention.

Google Gemini

Google is racing to put Gemini at the center of Android before Apple’s AI reboot. The Gemini app added a ‘thinking level’ control and integrations with Canva, Instacart, and OpenTable. Additionally, Gemini Intelligence brings proactive AI features to Android devices.

GitHub Copilot

Microsoft’s coding assistant restructured individual plans: Pro and Pro+ now include flex allotments, and a new Max plan was introduced. Developers can now also start cloud agent tasks via REST API, expanding automation capabilities.

Kimi (Moonshot AI)

The Chinese startup launched Kimi WebBridge, an open-source tool that turns AI into a local browser operator, automating complex web tasks. It’s also raising funds at a $20 billion valuation, signaling intense capital interest in Chinese AI.

Kling AI

The viral “Korean Baseball Trend” pushed Kling AI to the top of App Store charts in 42 countries. The company launched a browser-based platform for Kling 3.5 and introduced Kling API 3 for native 4K video production workflows.

Bolt.new

StackBlitz’s Bolt.new deepened its Pica integration (“Everything is connected”) and announced agentic building on Azure and Microsoft 365 for enterprise. Bolt for Teams also received upgrades, expanding collaborative AI-powered development.

Devin

Cognition’s AI software engineer jumped from Devin 2.1 to Devin 2.2, adding enhanced reasoning and tool use capabilities. The release notes detail an evolving platform that aims to fully automate mundane engineering tasks.

Perplexity

Perplexity expanded its financial data reach via a Morningstar integration and launched a personal computer AI agent for Mac that rivals local assistants, continuing its push toward an all-purpose answer engine.

Notion AI

Notion introduced its Developer Platform and shipped version 3.5, which includes new APIs and customization options. Admins also gained new custom agent controls for enterprise governance.

ERNIE (Baidu)

Baidu released ERNIE 5.1, topping multiple leaderboards with a model that writes better and understands users more deeply—all while pre-training costs were slashed to just 6% of industry peers.

Grok (xAI)

Elon Musk’s Grok appears to be losing momentum: a model deprecation was announced, and reports surfaced that Anthropic is taking over SpaceX compute power. The news raises questions about xAI’s competitiveness.

Qwen (Alibaba)

Alibaba integrated Qwen with Taobao for a conversational shopping experience. Qwen is also pushing image AI forward by innovating on the compression layer, potentially capturing more detail from prompts.

Hailuo AI (MiniMax)

Hailuo released Start & End Frames for its video model, mastering complex instructions and physics. It also launched a Video Agent in beta for zero-touch vibe video creation, and showcased AI creativity at Cannes.

HeyGen

The AI video avatar platform continued shipping, with April 2026 updates focused on improved avatar expressions and lip-sync. Earlier updates in February and January rounded out a strong year so far.

Descript

Descript opened its API in beta, enabling programmatic audio/video editing workflows. This marks a significant step toward embedding its editor into third-party tools.

Gamma

Gamma launched Gamma Imagine, an AI-native design tool for presentations, taking on Canva and Adobe with integrated image generation.

Midjourney

The image generation leader is teasing V8 Alpha, with updates appearing on its changelog. Details remain thin, but the community is watching closely.

Pika

Pika raised $80M in a Series B round, bringing total funding to $135M. The AI video tool from Stanford dropouts continues to outperform OpenAI’s Sora in certain benchmarks.

Flux AI

Flux AI launched CRAISEE Teams Enterprise, a scalable generative AI platform for organizations, expanding from PCB design into broader enterprise AI imagery.

Consensus

Consensus, the AI research search engine, raised $30M and introduced Pro Analysis, enhancing its ability to surface and summarize academic insights.

iFlyrec (iFlytek)

iFlytek’s audio transcription tool surpassed 100 million users. The company also applied multimodal AI to pig farming, boosting efficiency and disease detection—a quirky but impactful industrial use.

Doubao (ByteDance)

ByteDance’s Doubao explored paid tiers up to 500 yuan and launched Doubao-Seed-2.0-lite, a full-modal model that can listen, watch, and act—further monetizing its consumer AI app.

MetaSo

The semantic search engine MetaSo published documentation and a roadmap, but no major product launches were reported, indicating slow but steady development.


Sources are linked inline throughout the brief. For a full list of references, see the original search results.

Tools Mentioned

ChatGPT→Cursor→ElevenLabs→DeepSeek→Claude→Gemini→GitHub Copilot→Kimi→可灵→Bolt.new→Devin→Perplexity→Notion AI→DALL·E 3→文心一言→Grok→通义千问→海螺 AI→HeyGen→Descript→Gamma→Ideogram→Midjourney→Pika→Sora→Flux→Consensus→讯飞听见→豆包→秘塔 AI→
Featured
C
Freemium

ChatGPT

OpenAI's conversational AI assistant supporting text generation, coding, creative writing, and more.

ChatChatWritingCoding
Featured
C
Freemium

Cursor

VS Code-based AI-first code editor with powerful AI completions, inline editing, and codebase chat.

AgentEditorCodingCode Completion
Featured
E
Freemium

ElevenLabs

Leading AI voice synthesis platform supporting multilingual text-to-speech and voice cloning.

AudioTTSVoice CloneMultilingual
Featured
D
Freemium

DeepSeek

High-performance LLM from DeepSeek achieving GPT-4-level performance at a fraction of the cost, supporting 128K context and deep reasoning.

ChatChatCodingReasoning
Featured
C
Freemium

Claude

Anthropic's AI assistant, excels at long-text analysis, code review, and complex reasoning.

ChatChatCodingAnalysis
Featured
G
Freemium

Gemini

Google's multimodal AI model, deeply integrated with Google ecosystem, supporting text, image, and code understanding.

ChatChatMultimodalSearch
Featured
G
Freemium

GitHub Copilot

GitHub's AI coding assistant deeply integrated with VS Code, JetBrains, and other IDEs, supporting code completion and conversational coding.

AgentCode CompletionIDE IntegrationCoding
Featured
K
Freemium

Kimi

Moonshot AI's assistant known for ultra-long context (2M characters), excelling at document analysis, long-form summarization, and deep research.

ChatChatLong ContextAnalysis
Featured
K
Freemium

Kling

Kuaishou's AI video generation tool, creating high-quality short videos from text and images with realistic physics effects.

VideoVideo GenShort VideoPhysics
Featured
B
Freemium

Bolt.new

StackBlitz's browser-based AI full-stack app generator that creates runnable web apps from prompts with one-click deploy.

AgentCodingFull StackDeploy
Featured
D
Paid

Devin

Cognition AI's fully autonomous AI software engineer that independently handles the full dev cycle from requirements to deployment.

AgentCodingAutomationFull Stack
Featured
P
Freemium

Perplexity

AI search engine combining LLM with real-time web search, providing accurate answers with citations and deep research mode.

SearchSearchResearchReal-time
Featured
N
Paid

Notion AI

Notion's built-in AI features for writing assistance, summarization, translation, and brainstorming.

OfficeWritingSummarizationCollaboration
D
Freemium

DALL·E 3

OpenAI's image generation model, excels at understanding complex prompts and creating accurate, creative images.

ImageImage GenCreative
E
Freemium

ERNIE

Baidu's knowledge-enhanced LLM, deeply integrated with Baidu Search, excelling in Chinese language understanding and knowledge Q&A.

ChatChatKnowledgeChinese
G
Freemium

Grok

xAI's AI assistant featuring real-time X platform data access, humorous conversational style, and deep thinking mode.

ChatChatReal-timeReasoning
Featured
Q
Freemium

Qwen

Alibaba's LLM series covering chat, coding, multimodal, and more, supporting long context and complex reasoning.

ChatChatCodingMultimodal
H
Freemium

Hailuo AI

MiniMax's AI voice and creation platform, supporting voice synthesis, AI music, and intelligent conversation.

AudioTTSMusicChat
H
Paid

HeyGen

AI avatar video generation platform supporting virtual hosts, video translation, and lip-sync, widely used in marketing and education.

VideoAvatarVideo TranslateLip Sync
D
Freemium

Descript

AI-powered audio/video editor that lets you edit podcasts and videos like documents, with auto-transcription, filler removal, and AI voiceover.

AudioPodcastVideo EditTranscription
G
Freemium

Gamma

AI-powered presentation and document creation tool for quickly generating polished slides and web docs.

OfficeSlidesDocsPPT
I
Freemium

Ideogram

AI image generation tool specializing in text rendering, capable of accurately generating clear text and logos within images.

ImageImage GenText RenderLogo
Featured
M
Paid

Midjourney

Top-tier AI image generation tool, renowned for artistic style and high-quality output.

ImageImage GenArtDesign
Featured
P
Freemium

Pika

AI video generation tool by Pika Labs, supporting text/image-to-video with rich creative effects and easy operation.

VideoVideo GenCreativeText-to-Video
Featured
S
Paid

Sora

OpenAI's text-to-video model, capable of generating high-quality videos up to one minute long.

VideoVideo GenText-to-Video
Featured
F
Freemium

Flux

Image generation model by the original Stable Diffusion team at Black Forest Labs, with industry-leading image quality and text rendering.

ImageImage GenHigh QualityText Render
C
Freemium

Consensus

AI academic search engine that extracts and summarizes findings directly from research papers.

SearchAcademicPapersResearch
i
Freemium

iFlyrec

iFlytek's speech-to-text platform, supporting real-time voice recognition, meeting transcription, and multi-language translation.

AudioSpeech-to-TextMeetingTranslation
Featured
D
Free

Doubao

ByteDance's AI assistant with text-to-image, voice chat, web search and other multimodal capabilities, excellent Chinese experience.

ChatChatMultimodalSearch
Featured
M
Free

MetaSo AI

AI search engine providing structured answers and mind maps, ideal for deep research and knowledge organization.

SearchSearchResearchMind Map
← Back to AI News