Tips

July 4, 2024

6 mins

This Week in AI

Hello, Niuralogists!

This week, our edition dives into the dynamic world of artificial intelligence, aiming to spotlight the latest breakthroughs. We're dedicated to examining the profound implications of these advancements across various domains, from workplaces and businesses to policies and personal interactions. Featured updates include ElevenLabs' 'Iconic Voices' and a groundbreaking camera inspired by the human eye and more.

For a more in-depth understanding, keep on reading…

ElevenLabs Unveils ‘Iconic Voices’

ElevenLabs, an AI audio company, has introduced a new feature called ‘Iconic Voices’ for its recently launched Reader App, enabling users to have text read by AI-generated voices of famous Hollywood stars. The initial lineup includes AI-recreated voices of Judy Garland, James Dean, Burt Reynolds, and Sir Laurence Olivier, with more voices to be added in the coming months. ElevenLabs has secured licensing agreements for these voices with CMG Worldwide, which manages the estates of the featured late celebrities. Users can have the AI voices read books, articles, PDFs, and other text content within the Reader App, although the voices cannot be used to create shareable content on the platform. This development underscores the increasing competition for voice actors and highlights the potential of AI-generated voice clones, while also setting a significant licensing precedent in the industry by collaborating directly with the estates of the late Hollywood stars.

Revolutionary Camera Inspired by the Human Eye

A team led by computer scientists at the University of Maryland has developed a groundbreaking camera mechanism that enhances how robots perceive and interact with their environment. Inspired by the human eye's ability to maintain clear and stable vision through tiny involuntary movements called microsaccades, the innovative camera system, known as the Artificial Microsaccade-Enhanced Event Camera (AMI-EV), mimics these movements to capture sharp, blur-free images even during motion. This advancement is detailed in a May 2024 paper published in Science Robotics. The AMI-EV uses a rotating prism to redirect light beams, simulating the natural movement of the human eye and stabilizing the textures of recorded objects. The team also created software to consolidate stable images from the shifting light. This technology promises to improve robotic vision, autonomous driving systems, and various applications requiring precise image capture, such as smart wearables and virtual reality. In tests, the AMI-EV successfully captured tens of thousands of frames per second, surpassing the capabilities of most commercial cameras, and demonstrating its potential in areas like augmented reality, security monitoring, and space imaging.

Runway Releases Gen-3 Alpha Access

Runway has announced that its AI video generator, Gen-3 Alpha, is now available to all users following weeks of impressive viral outputs since its release in mid-June. Unveiled last month as the first model in Runway's next-gen series, Gen-3 Alpha is designed for learning 'general world models' and includes upgrades to key features such as character and scene consistency, camera motion and techniques, and scene transitions. The model is accessible through Runway's ‘Standard’ plan at $12 per month, which provides users with 63 seconds of generation time each month. Additionally, a free hands-on workshop will be held on Friday at AI University, covering how to create an AI commercial using Gen-3, ElevenLabs, and Midjourney. Despite impressive releases from competitors KLING and Luma Labs, Gen-3 Alpha represents a significant leap in AI video technology, although the limited generation time for non-unlimited plans may be a challenge for power users.

Free Shallow Focus Photography of Black Quadcopter Near Body of Water Stock Photo
Source: Pexels

Google Unveils Gemma 2 and Gemini Upgrades

Google has launched Gemma 2, the latest addition to its open lightweight AI model series, along with new upgrades to its Gemini 1.5 Pro model. Gemma 2 is available in two sizes: a 9 billion parameter model and a larger 27 billion parameter model, with a 2.6 billion parameter lightweight version hinted for the future. The 27 billion parameter model rivals models more than twice its size, while the 9 billion parameter model significantly outperforms comparable models like Llama 3 8B. Additionally, Google has expanded access to Gemini 1.5 Pro’s 2 million token context window, enabling the processing of much longer inputs. The upgraded Gemini models also feature enhanced coding capabilities, allowing them to generate and run Python code, thereby improving accuracy in math and data reasoning tasks. These advancements position Google’s Gemma models as leading open-source AI tools that operate efficiently on a single GPU, while the expanded context window for Gemini 1.5 Pro unlocks a plethora of new capabilities for users.

AI Tools Alleviate Therapists' Burnout by Lightening Caseloads

In today's fast-paced tech environment, embracing change is key to shaping the future. A recent ZDNET feature explores AI's transformative impact on therapy, addressing the urgent demand amid heightened mental health challenges exacerbated by events like the COVID-19 pandemic. Teletherapy platforms like BetterHelp and Talkspace have emerged to fill accessibility gaps but have also introduced new complexities, such as increased workloads and gig-like work structures for therapists. To alleviate these pressures, therapists are increasingly turning to AI tools like Upheal, which streamline administrative tasks such as notetaking and documentation, allowing them to dedicate more time to direct patient care and reducing burnout risks. These tools leverage AI for analyzing patient data, providing insights, and suggesting treatment plans, while chatbots like Woebot and Wysa offer immediate mental health support and interactive exercises, complementing traditional therapy approaches. Therapists are also innovating by developing AI solutions tailored to their practices, aiming to democratize mental health support while preserving the essential human touch in therapy.

Free Crop faceless multiethnic interviewer and job seeker going through interview Stock Photo
Source: Pexels

Q&Ai

How does the US plan to address the global semiconductor shortage by closing the talent gap?

The global semiconductor industry faces a critical challenge with a widening skills gap that could hinder its growth and innovation. According to Deloitte, the industry will need more than a million skilled workers by 2030 to meet escalating global demand, a problem exacerbated by shortages in countries like Taiwan, South Korea, China, Japan, and Europe. The US, accounting for only 12% of global chip production, aims to tackle this issue through the CHIPS and Science Act. This legislation allocates significant funds to develop the semiconductor workforce, focusing on technician roles and other critical positions. Despite these efforts, long-term challenges remain, including global competition for talent and issues like limited career advancement opportunities. Addressing these complexities will require collaborative efforts to attract and retain skilled workers essential to sustaining the semiconductor industry's leadership and innovation.

Is next-gen AI technology transforming daily work for employees?

Advancements in next-generation AI technology are reshaping everyday work dynamics, showcasing its potential to significantly boost productivity and refine essential soft skills.  Backed by insights from a Harvard Business School study, it illustrates how generative AI accelerates task completion by 25%, improves job performance, and boosts overall job satisfaction. Innovators like 4149.AI, Arc53, and Lavender are leading the charge with AI solutions that streamline operations, personalize interactions, and foster better team collaboration. MongoDB's support in empowering these startups with flexible data-driven applications underscores AI's potential to reshape modern work environments through advanced database technologies.

Tools

📈 June is AI-powered customer analytics for product-focused teams

📲 Pygma is a personal AI social media manager

🚀 AppFlowy is an open-sourced alternative to Notion, manages wiki & projects with AI

🖥️ VisualSitemaps autogenerates visual sitemaps for UX and SEO

⚖️ Created by Human is an AI rights licensing platform for creators

Newsletter

📬 Receive our amazing posts straight to your inbox. Get the latest news, company insights, and Niural updates.

Thank you! Your message has been received!
Oops! Something went wrong. Please fill in the required fields and try again.

Follow us on Twitter and LinkedIn for more content on artificial intelligence, global payments, and compliance. Learn more about how Niural uses AI for global payments and team management to care for your company's most valuable resource: your people.

See you next week!

Request a demo