Data Machina #238 – Data Machina


Non-stop AI Innovation Every Single Week. Well yeah, thats’s right: There is no single week without something new, exciting, or amazing happening in AI. This is a selection of interesting, cool stuff that happened in the last 7 days or so:

OpenAI introduced new, faster, and more efficient embedding models. Buried in the blog announcement, it says: “the new embedding models were trained with a technique that allows developers to shorten embeddings without the embedding losing its concept-representing properties.” Well – for some reason- it seems the blog fails to mention that the technique is called Matryoshka Representation Learning (paper, repo), an encoding method for embedding proposed by Google Research in 2022.

Voyage AI announced voyage-code-2, a new embedding model specifically optimised for code-related applications, including semantic code search/retrieval, code completion, and various functions of general code assistants. The model was evaluated on 11 code retrieval tasks, and performs well against OpenAI’s and Cohere’s models.

The team at 01.ai open-sourced Yi Vision Language (Yi-VL), a new, top performant, multimodal model for enabling content comprehension, recognition, and multi-round conversations about images. Yi-VL is based on the LlaVA vision instruction-tuning architecture, and as a few days ago, it was raking first in all the benchmarks for those kind of open source models.

MS Research published a new interesting paper: RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study. This is a must-read for those learning on or building LLM apps that involve RAG or fine-tuning. Which method is better? In which cases? In the paper, the researchers propose a pipeline for fine-tuning and RAG tailored for a specific industry domain, and then they present the pros & cons, and tradeoffs of both methods for multiple popular LLMs, including Llama2-13B, GPT-3.5, and GPT-4. Lots of insights, a great read!

The search for new alternatives to the Transformer & Attention Mechanism, and the Mamba Paper published in December have literally triggered a storm of Mamba-based derivative models, papers and memes too. The new idea is to integrate structured state space models (SSMs) into a simplified neural net without attention or even MLP blocks. Interested in SSMs? Checkout this free, interactive book: State Space Models: A Modern Approach with accompanying Python code. And here are the latest papers on Mamba-based models published in January:

Google Research announced Lumiere, a A Space-Time Diffusion Model for Video Generation (paper, demos.) The model produces truly amazing results in: text-to-video, image-to-video, stylised generation, video stylisation, cinemagraphs, and in painting. The innovation here is the introduction of a new Space-Time U-Net architecture that generates the entire temporal duration of the video at once, through a single pass in the model. Checkout the demo video below:

Have a nice week.

  1. The Future of ML: Auto ML + AI Agents

  2. The Big Picture of AI Research

  3. Are We at Peak Vector DB?

  4. Exploring ColBERT with RAGatouille

  5. Recursive Embedding & Clustering at Spotify

  6. AutoML for Content Abuse Detection at LinkedIn

  7. What I Learned Building the ML Platform at Mailchimp

  8. AI that Quacks: A New Txt-2-SQL Model in DuckDB

  9. A Curated List of Free ML/ DL Youtube Courses

  10. a16z VC : Prosumer 2.0 & Rise of AI Native Workflows

Share Data Machina with your friends

  1. Implement a Sparse Mixture of Experts Model from Scratch

  2. How to Merge Several LLMs into One Model, Jan 2024

  3. How to Build Games with Crew AI Agents

  1. CMU Spring 2024 Neural Code Generation (slides, papers)

  2. Deep RL + Raspberry Pi Zero for WiFi Pawning

  3. MS Research: The Autoregressive/ Non- Autoregressive Battle in Speech Synthesis

  1. InstantID: Zero-shot Identity-Preserving Generation in Seconds

  2. Dense X Retrieval – Propositions as a Novel Retrieval Unit

  3. DeepMind GATS: A New Approach to Combine Pretrained Foundation Models

  1. CI/CD for ML in 2024: Best Practices

  2. Lessons Learnt Operationalising LLM Apps

  3. The Ultimate Guide to ML Model Deployment in 2024

  1. Visual Analysis of Family Spending in the EU

  2. Top 23 Spatial Analysis & Visualisations in 2023 with Carto

  3. [interactive] Visualising 2023 Netflix Engagement Report

  1. FuseMachines – Enterprise AI Platform

  2. OpenDialog – Conversational AI for Regulated Industries

  3. BlueSheets – AI for Accounting Automation

  1. WebDataset – 12M Img-Txt Pairs for Vision-Language

  2. InstructDoc Dataset for Zero-Shot Visual Doc Understanding

  3. WildRGB-D: 20K RGB-D Vids for Real-World 3D Object Learning

Enjoyed this post? Tell your friends about Data Machina. Thanks for reading.

Share

Tips? Suggestions? Feedback? email Carlos

Curated by @ds_ldn in the middle of the night.



Tags:

We will be happy to hear your thoughts

Leave a reply

0
Your Cart is empty!

It looks like you haven't added any items to your cart yet.

Browse Products
Powered by Caddy