A Walkthrough of Nvidia’s Newest Multi-Modal LLM Household | by Mengliu Zhao | Oct, 2024

From LLaVA, Flamingo, to NVLM Multi-modal LLM growth has been advancing quick in recent times. Though…

From Set Transformer to Perceiver Sampler | by Mengliu Zhao | Oct, 2024

On multi-modal LLM Flamingo’s imaginative and prescient encoder Designing Multi-modal LLM is tough. The state-of-the-art multi-modal…

The Thriller Behind the PyTorch Computerized Blended Precision Library | by Mengliu Zhao | Sep, 2024

Information Format Fundamentals — Single Precision (FP32) vs Half Precision (FP16) Now, let’s take a better…

A Sensible Information to Contrastive Studying | by Mengliu Zhao | Jul, 2024

Now it’s time for some contrastive studying. To mitigate the problem of inadequate annotation labels and…