Image by Ideogram
Story: Ideogram AI has released Ideogram 2.0, a cutting-edge text-to-image model that aims to compete or even outperform competitors like the much hyped Flux model. It is cutting edge in generating realistic images, graphic designs, typography, and more. The new model, available on ideogram.ai and the new iOS app, offers enhanced control over image style, color palette, and aspect ratios, empowering users to create stunning visuals with ease.
Key Findings:
Superior Performance: Ideogram 2.0 outperforms other text-to-image models in image-text alignment, subjective preference, and text rendering accuracy.
Diverse Styles: Users can choose from distinct styles, including Realistic, Design, 3D, and Anime, tailored to render unique genres of images.
Enhanced Text Accuracy: The Design style significantly improves text accuracy in generated images, ideal for creating premium graphic designs for various purposes.
Cost-Effective API: The Ideogram API offers superior image quality at a lower cost compared to other offerings, enabling developers to integrate Ideogram into their applications.
Pixit‘s Two Cents: With its stunning ability to create realistic images and stunning graphic designs, it's set to be a contendor for our new best image model. The granular control over style, color palette, and aspect ratios is a unique addition to usual control mechanisms. Most stunning to us is its incredible ability to generate long form text (see cover image), which makes it far superior to all others at the moment. Including Flux.
Story: As of August 2024, the European startup ecosystem is experiencing a surge in AI investments, with 14 companies securing funding rounds valued at $100 million or more. In fact, AI startups have been at the forefront of innovation, with over 1,700 funding rounds taking place across the region this year alone. Here’s an overview of the biggest investment rounds in European AI this year.
Key Findings:
Wayve - $1 billion: Wayve is a UK-based autonomous driving startup that raised $1 billion this year, one of the largest AI deals in Europe. Wayve sells its AI technology to a variety of carmakers, rather than making the vehicles itself.
Mistral - $650 million and $431 million: Mistral is a French-based company building large language models in Europe. One of its unique selling points has been its embrace of open source.
Helsing - $484 million: Heling is an AI denfese company with operations in UK, Germany, and France. It’s focus is on developing Ai software to make military forces more effective and protect deomocracies.
Poolside - $400 million: Poolside just recetnly relocated headquarters to Paris, building AI tools to help developers speed up software development, with goals to eventually allow non-coders to create applications.
DeepL - $320 million: DeepL is a Cologne-based startup that offers B2B AI-based text translation and writing tools.
H - $220 million: H (for heady) has not yet launched any products but the company focuses on building AI gents for task automation and decision-making to boost worker productivity.
Flo Health - $200 million: Flo Health is based in London. The company is building a women’s health tracking app (period and ovulation).
Pigment - $145 million: Pigment is based in Paris, France. The startup provides AI-powered enterprise resource planning software for finance teams.
Pixit‘s Two Cents: We love to see that Europe is investing in AI companies. For companies in the AI space, these developments indicate a promising landscape for innovation and growth.
Image by Google
Story: Google has introduced Gemini Live, a very strong voice interaction feature for its Gemini platform, during the recent Pixel 9 event. Exclusive to Gemini Advanced subscribers, Gemini Live enables users to engage in free-flowing conversations with the AI assistant using natural language (just like the highly anticipated ChatGPT feature), allowing for interruptions and the ability to pause and resume discussions seamlessly.
Key Findings:
Natural Voice Interactions: Gemini Live allows users to engage in free-flowing conversations with the AI assistant, similar to ChatGPT's voice chat functionality.
Interrupt and Resume: Users can interrupt the assistant mid-response or pause discussions and resume them later, enhancing the natural flow of conversation.
Background Functionality: Gemini Live can operate in the background or while the device is locked, providing a seamless user experience.
Diverse Voice Options: Google has introduced ten new voices for users to choose from, catering to individual preferences.
Pixit‘s Two Cents: Google's introduction of Gemini Live is a big shot towards OpenAI with its similar feature in ChatGPT that is slowly starting to roll out. It brings us closer to truly natural and intuitive interactions with technology. The ability to engage in free-flowing conversations, complete with interruptions and the option to pause and resume discussions, is a game-changer that will definitely reshape the way we interact with AI. I can’t wait to try it out!
Artists' Lawsuit Against Stability AI and Midjourney Gains Momentum: A judge has allowed additional copyright and trademark infringement claims to proceed against Stability AI, Midjourney, and other AI companies in a lawsuit filed by artists, who allege that their works were used to train AI models without consent or compensation.
Exists Launches GenAI Platform for Creating 3D Games from Text Prompts: Exists introduces its GenAI platform, which enables users to create 3D games using natural language text prompts.
Midjourney Releases New Unified AI Image Editor on the Web: Midjourney unveils a new web-based AI image editor that combines its various image generation and editing capabilities into a single, user-friendly interface, streamlining the creative process for users.
Luma Launches Dream Machine 1.5 with Enhanced Features: Luma releases Dream Machine 1.5, an update to its AI-powered image and video generation tool, introducing new features such as improved text-to-image capabilities, expanded aspect ratios, and enhanced user control over the creative process.
Stability AI Appoints New Chief Technology Officer: Stability AI, the company behind the popular Stable Diffusion model, names Hanno Basse as thenew Chief Technology Officer to lead its technical strategy.
TurboEdit: Instant Text-Based Image Editing: Researchers introduce TurboEdit, a novel approach to text-based image editing that enables instant modifications to images using natural language instructions.
Former President Donald Trump Shares AI-Generated Images on Social Media: Former U.S. President Donald Trump shares controversial AI-generated images on social media, including a fictitious endorsement from Taylor Swif, sparking discussions about the potential misuse of AI in political contexts.
Procreate's Canvas Rebellion: Painting a Future Without AI: Procreate, a popular digital illustration app, takes a stand against generative AI by pledging to remain focused on human creativity and not incorporate AI-assisted features, highlighting the ongoing debate about the role of AI in the creative industries.
ElevenLabs' AI Reader App Now Supports 32 Languages Globally: ElevenLabs expands its AI-powered text-to-speech Reader app globally, adding support for 32 languages, including Portuguese, French, Mandarin, and Hindi.
Silicon Valley Opposes California's AI-Safety Bill: Tech giants and industry leaders in Silicon Valley are actively opposing a proposed AI-safety bill in California, which aims to regulate the development and deployment of artificial intelligence systems, citing concerns about potential limitations on innovation and competitiveness.
Andrew Ng Steps Back at Landing AI After Announcing New Fund: Andrew Ng, a prominent figure in the AI industry, announces his decision to step back from his role at Landing AI, the company he founded, shortly after unveiling a new fund focused on investing in artificial intelligence startups and initiatives.