تاریک روشن

سی‌وید

سـرگـرمی
کـودکـان
ورزشــی
عــلـم و فـنـاوری
خــودرو و وســایـل نـقـلـیه
مـوسـیقـی
اخــبـار
بـازی و سـرگـرمی
حـیـوانـات و طـبـیعت
مــذهـبـی

تاریک روشن

صفحه اصلی
DCMA
کمک به خیریه محک

سی‌وید

سـرگـرمی
کـودکـان
ورزشــی
عــلـم و فـنـاوری
خــودرو و وســایـل نـقـلـیه
مـوسـیقـی
اخــبـار
بـازی و سـرگـرمی
حـیـوانـات و طـبـیعت
مــذهـبـی

تاریک روشن

صفحه اصلی
DCMA
کمک به خیریه محک

GGUF quantization of LLMs with llama cpp

AI Bites منتشر شده در تاریخ 1403/01/03

2.9 هزار بار بازدید - 6 ماه پیش - Would you like to run

Would you like to run LLMs on your laptop and tiny devices like mobile phones and watches? If so, you will need to quantize LLMs. LLAMA.cpp is an open-source library written in C and C++. It allows us to quantize a given model and run LLMs without GPUs. In this video, I demonstrate how we can quantize a fine-tuned LLM on a Macbook and run it on the same Macbook for inference. I quantize the fine-tuned Gemma 2 Billion parameter model that we fine-tuned in my previous tutorial but you can use the same steps for quantizing any other fine-tuned LLMs of your choice. MY KEY LINKS YouTube: youtube.com/@AIBites Twitter: twitter.com/ai_bites Patreon: www.patreon.com/ai_bites Github: github.com/ai-bites WHO AM I? I am a Machine Learning researcher/practitioner who has seen the grind of academia and start-ups. I started my career as a software engineer 15 years ago. Because of my love for Mathematics (coupled with a glimmer of luck), I graduated with a Master's in Computer Vision and Robotics in 2016 when the now happening AI revolution started. Life has changed for the better ever since. #machinelearning #deeplearning #aibites

#education
#machinelearning
#deeplearning
#transformers
#artificial_intelligence
#ai
#deep_learning
#machine_learning
#educational
#how_to_learn_ai

6 ماه پیش در تاریخ 1403/01/03 منتشر شده است.

2,984 بـار بازدید شده

... بیشتر

10:42

LoRA (Low-rank Adaption of AI Large Language Models) for fine-tuning LLM models

16:57

Lo que necesitas saber de IA como Developer: Modelos vs Ollama vs LangChain

24:20

host ALL your AI locally

19:35

DsPy crash course - optimize your LLM pipelines with DsPy (Part 1)

17:52

Everything you need to know about Fine-tuning and Merging LLMs: Maxime Labonne

24:11

Fine-tuning LLMs with PEFT and LoRA - Gemma model & HuggingFace dataset

27:43

Quantize any LLM with GGUF and Llama.cpp

10:30

All You Need To Know About Running LLMs Locally

31:04

Reliable, fully local RAG agents with LLaMA3.2-3b

5:46

How to Convert/Quantize Hugging Face Models to GGUF Format | Step-by-Step Guide

12:01

Llama-CPP-Python: Step-by-step Guide to Run LLMs on Local Machine | Llama-2 | Mistral

اشــتـراک گـذاری

دانــلـود

این امکان در حال حاضر وجود ندارد.

بـیـشــتر

شناسه ویدئو : j7ahltwlFH0