The AI Journal
Posts
Google’s Video & Image Upgrades, OpenAI’s Search Boost, Meta’s Smart Glasses, & YouTube’s Creator Controls 🚀

Google’s Video & Image Upgrades, OpenAI’s Search Boost, Meta’s Smart Glasses, & YouTube’s Creator Controls 🚀

Naseema Perveen
December 17, 2024 • Estimated Reading Time: 10 minutes

Happy Tuesday, AI & Data Enthusiasts! Big moves in AI this week! 🚀 Google’s rolling out new tools for video and image creation, OpenAI is bringing ChatGPT search to everyone, Meta’s smart Ray-Ban glasses just got smarter with live translations, and YouTube is giving creators more say over AI training. Let’s check it out!

In today’s edition:

🎥 Google Unveils Veo 2, Imagen 3, and Whisk: Advancing AI Video and Image Creation

🔍 OpenAI Brings ChatGPT Search to All Users

👓 Meta Upgrades Ray-Ban Smart Glasses with Real-Time AI Video and Live Translation

🎬 YouTube Gives Creators More Control Over AI Training

- Naseema Perveen

WHAT CAUGHT OUR ATTENTION MOST

Google Unveils Veo 2, Imagen 3, and Whisk: Advancing AI Video and Image Creation

Google is pushing creative boundaries again with the release of Veo 2 and Imagen 3—cutting-edge tools for video and image generation. Plus, a brand-new experiment, Whisk, is here to make remixing your ideas faster and more fun. Let’s dive into how these advancements can bring your visuals to life.

Veo 2: The new video-generation model delivers stunning, high-quality videos up to 4K resolution with improved physics, human movement, and cinematographic control. It’s available now via VideoFX in Google Labs.
Imagen 3: This upgraded image-generation model creates sharper, more detailed images across diverse styles—photorealistic, abstract, anime, and more. Now rolling out globally in ImageFX.
Whisk: A playful new experiment that combines image input and remixing capabilities, powered by Imagen 3 and Gemini. Start with an image, refine the scene and style, and create something uniquely your own.

Google’s latest innovations—Veo 2, Imagen 3, and Whisk—are redefining how we create and remix visuals. Whether it’s high-quality videos, detailed images, or quick creative experiments, these tools signal a future where AI fuels limitless creativity

AI JOURNAL TECH IMPACT AWARDS 2024

The Standard Bird Deadline for the AI Journal Tech Impact Awards 2024 has been extended, giving you 7 extra days to apply and save $150!

With 25 categories—including Generative AI, Business Analytics, Cybersecurity, and more—these awards cover everything from the best use cases to standout teams, individuals, and companies. There’s no better time to claim your spot among the top AI experts.

You can view all previous AI Journal Tech Impact Award winners here.

Entering is more than just an opportunity; it’s a must for businesses looking to stand out, attract top talent, and earn the trust of customers and the wider market.

Submit your entry here.

WORTH WATCHING

Tune in for a compelling discussion about innovation, possibility, and the balance between technology and human ingenuity. Bill Gates explores the transformative power of Artificial Intelligence and its potential to reshape our future. From tackling global challenges to enhancing human productivity, Gates shares his insights on how AI can unlock opportunities and improve lives.

Watch now to see how Gates envisions AI shaping our world.

KEEP YOUR EYE ON IT

OpenAI Brings ChatGPT Search to All Users

OpenAI has rolled out its ChatGPT-powered search feature to all free users, aiming to disrupt Google’s search engine dominance. With real-time answers and cited sources, OpenAI positions ChatGPT as a cleaner, AI-driven alternative for finding information online.

Real-Time Information: ChatGPT can now pull live data from the web, overcoming previous data cutoffs and offering timely answers.
Cited Sources: Responses include links to relevant sources, improving transparency and accuracy.
Clean Interface: Unlike Google, ChatGPT’s search is ad-free, providing a clutter-free user experience.
Seamless Integration: The search feature is embedded within the ChatGPT platform and does not require a standalone product.

With its new search capabilities, OpenAI is redefining how users interact with online information. By combining AI precision with real-time web results, ChatGPT challenges traditional search engines like Google and sets the stage for a smarter, faster browsing experience

Meta Upgrades Ray-Ban Smart Glasses with Real-Time AI Video and Live Translation

Meta has rolled out significant AI-powered upgrades to its Ray-Ban smart glasses, enhancing functionality with real-time video recognition, live translation, and conversational AI. These features position the glasses as a frontrunner in the growing AI-powered wearables market, taking on rivals like OpenAI and Google.

Real-Time AI Video: The glasses’ front-facing camera now integrates with Meta AI to analyze and answer questions about a user’s surroundings in real-time. This feature was showcased during Meta’s Connect conference and is designed to make the AI assistant a seamless part of daily life.
Live AI Conversations: With the new “live AI” upgrade, users can hold ongoing, back-and-forth conversations with Meta AI. This hands-free experience makes interactions with AI feel smoother and more intuitive.
Live Translation: The glasses now support real-time speech translation between English and Spanish, French, or Italian. When someone speaks one of these languages, the glasses will translate their speech into English and play it through the open-ear speakers.
Music Recognition with Shazam Integration: Ray-Ban Meta glasses now come equipped with Shazam support. Users can simply say, “Hey, Meta, Shazam this song,” and the glasses will identify the track playing nearby, making it easy to discover new music while on the go.

Meta’s new updates to Ray-Ban smart glasses elevate their utility, making them more than just a tech accessory. With real-time AI video, live translations, and intuitive conversations, these glasses are setting a new standard for wearable AI. As Meta continues to refine and expand its features, it solidifies its place as a leader in the next generation of smart eyewear.

YouTube Gives Creators More Control Over AI Training

YouTube has announced a new feature that allows creators to choose whether to allow third-party AI companies to train their models using their content. This new option gives creators more control over their intellectual property and opens up potential avenues for compensation.

Creators can opt-in: Through YouTube Studio, creators can select specific third-party companies (including AI giants like OpenAI, Meta, and Microsoft) to use their content for training AI models.
Default settings: By default, no third-party company can use a creator’s videos unless they explicitly opt-in, ensuring control over their content.
Future possibilities: In the future, YouTube may allow authorized companies to directly download videos for training, offering creators more opportunities for compensation.

This new feature is a step toward transparency and control for creators, addressing previous concerns about AI companies using content without permission or compensation.

TERM OF THE DAY

Eigenvalue

An Eigenvalue in data science is a special number that tells us how much a certain direction or feature in data "stretches" or "shrinks" during transformations like rotations or scaling. Imagine you push a rubber ball, and instead of changing shape everywhere, it only stretches along one line—this line is related to the eigenvalue. For example, if a transformation makes a line stretch by 3 times, the eigenvalue is 3. It helps in tools like PCA (Principal Component Analysis) to find the most important directions in data.

Read more terms like this in our Glossary.

ICYMI

Reddit Tests a Conversational AI Search Tool.
Aurora Launches as X Unlocks Grok for Free Use.
U.S. Clears Export of Advanced AI Chips to UAE Amid Microsoft Partnership.
Nvidia Spotlights Accenture: A Key AI Player for 2025.
Meta Unveils New Llama AI Model and Future Plans.

AI TOOLS FOR LLM ENGINEERS

LangChain – Framework for building applications using LLMs with modular components.
Hugging Face Transformers – Open-source library for LLMs like GPT and BERT.
OpenAI API – Access to powerful GPT models for text generation and embeddings.
LlamaIndex – Connect, and index your private data for LLM-powered querying.
Pinecone – Vector database for managing and querying embeddings efficiently.

$$$ MONEY MATTERS

xAI raises $6B: Elon Musk's vision for AI dominance expands.
Cake raised $13 million to simplify open-source AI adoption in the enterprise.
AI medical company Cleerly raised $106 million in new funding.
OpenAI confirms a new $200 monthly subscription.
AI for meetings platform Fathom raised $17 million in new funding.

LINKS WE’RE LOVIN’

✅ Podcast: Dangers of AI and the End of Human Civilization | Lex Fridman Podcast #368.

✅Cheat sheet: Large Language Model Cheat Sheet.

✅Course: Generative AI with Large Language Models.

✅Whitepaper: Role of AI & RPA In Banking & Insurance.

✅Watch: JBL Wave Buds 2 & JBL Wave Beam 2.

A Quick Question before you go…

What data structure is used to handle recursive calls?

SHARE THE NEWSLETTER & GET REWARDS

Your referral count: 0

Or copy & paste your referral link to others: https://aijournal.beehiiv.com/subscribe?ref=PLACEHOLDER

What do you think of the newsletter?

That’s all for now. And, thanks for staying with us. If you have specific feedback, please let us know by leaving a comment or emailing us. We are here to serve you!

Join 130k+ AI and Data enthusiasts by subscribing to our LinkedIn page.

Become a sponsor of our next newsletter and connect with industry leaders and innovators.

Reply

or to participate.