chibet Crickettitle_temp

Wenqi Glantz

Pinned

An Overview of My Blog Posts

Learning by Blogging

Jul 26, 2023

An Overview of My Blog Posts

Jul 26, 2023

Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks

This blog was first published on NVIDIA Tech Blog on May 8, 2025, coauthored with Dhruv Nandakumar, Agerneh Dagnew and Nicola Sessions.

May 11

Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks

May 11

Published in

TDS Archive

The Journey of RAG Development: From Notebook to Microservices

Converting a Colab notebook to two microservices with support for Milvus and NeMo Guardrails

Feb 21, 2024

The Journey of RAG Development: From Notebook to Microservices

Feb 21, 2024

Published in

TDS Archive

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

Exploring NeMo Guardrails’ practical use cases

Feb 9, 2024

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

Feb 9, 2024

Published in

TDS Archive

12 RAG Pain Points and Proposed Solutions

Solving the core challenges of Retrieval-Augmented Generation

Jan 30, 2024

12 RAG Pain Points and Proposed Solutions

Jan 30, 2024

Published in

TDS Archive

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

Exploring robust RAG development with LlamaPacks, Lighthouz AI, and Llama Guard

Jan 29, 2024

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

Jan 29, 2024

Published in

TDS Archive

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

My observations from experimenting with model merge, evaluation, and two model fine-tuning techniques

Jan 19, 2024

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

Jan 19, 2024

Published in

TDS Archive

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

Jan 15, 2024

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

Jan 15, 2024

Published in

TDS Archive

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

A step-by-step guide on deploying LlamaIndex RAGs to AWS ECS fargate

Jan 8, 2024

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

Jan 8, 2024

Published in

TDS Archive

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

How to add Llama Guard to your RAG pipelines to moderate LLM inputs and outputs and combat prompt injection

Dec 27, 2023

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

Dec 27, 2023

Wenqi Glantz

Wenqi Glantz

Friend of Medium

Mom, wife, architect with a passion for technology and crafting quality products

Following

Help

About

Careers

Press

Blog

Privacy

Rules

Terms