SlideShare a Scribd company logo
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
Milvus: An Open-Source
Vector Database
Build cool stuff!
Learn about Deep
Learning
embeddings
Create a Vector
Database
Welcome!
Stefan Webb
Developer Advocate, Zilliz
stefan.webb@zilliz.com
https://www.linkedin.com/in/stefan-webb
https://x.com/stefan_webb
Tim Spann
Principal Developer
Advocate, Zilliz
tim.spann@zilliz.com
https://www.linkedin.com/in/timothyspann
https://x.com/PaaSDev
Jiang Chen
Head of Ecosystems and
Developer Relations, Zilliz
jiang.chen@zilliz.com
https://www.linkedin.com/in/jiangc1010
https://x.com/jiangc1010
Today’s Schedule
Morning Session (PT)
9.45 – 10.15
Intro to Milvus and
Vector Databases
10.15 – 10.35 Getting Started with Milvus
10.35 – 11.45
Contributing to Milvus; or,
Milvus Workshop
13.15 – 13.45
Intro to Milvus and
Vector Databases
13.45 – 14.05 Getting Started with Milvus
14.00 – 15.15
Contributing to Milvus; or,
Milvus Workshop
Afternoon Session (PT)
Searching the Web with Gen AI
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
VS.
Apple
VS.
Rising dough
VS.
Change car tire
Rising Dough
Proofing Bread
✔
❌
Why is Semantic Search Difficult?
Why is Semantic Search Important?
20%
Other
newly generated data in 2025
will be unstructured data
80%
Data Source: The Digitization of the World by IDC
Solution: Deep Learning
Similarity Search
New Challenge: Search in Vector Spaces
How to Index and
Search?
● High-dimensional
● > 1000 dims
How to Scale?
● 10-100 million vectors?
● Billions?
● Trillions?
● Billions of users?
Multiple Data Types?
● Text
● Images
● Audio
● Graphs
● …
| © Copyright 2024 Zilliz
13
Milvus is an Open-Source Vector Database to
store, index, manage, and use the massive
number of embedding vectors generated by
deep neural networks and LLMs.
contributors
400
stars
30K
docker pulls
66M
forks
2.7K
+
Milvus: The most widely-adopted vector database
Milvus Users
Integrations
Framework
Hardware
Infrastructure
Embedding Models LLMs
Software Infrastructure
Vector Database
Why Open-Source?
Cost-effective Innovation Community
Why Not Traditional Databases?
Suboptimal
Indexing / Search
Scaling Inadequate Query
& Analytics Support
All About Vector
Databases and Milvus
Semantic Similarity?
How Does Similarity Search Work?
Deployment Options
Milvus Lite
● Locally hosted
● Suitable for prototyping
and demos
Milvus Standalone
● Single remote/local server
● “Medium” scale
● Simplified setup,
maintenance, etc.
compared to cluster
Milvus Cluster
● Distributed system
● Many different types of
nodes
● Scales to 100s of billions
of vectors
Vector Database Architecture
Benchmarks
Shows 3-20x faster comparing with open
source Milvus
At least 6x faster than other vector databases
https://github.com/zilliztech/VectorDBBench
Demo Time: What Can
you Build with Milvus?
#1 - Multimodal Image Search
● https://multimodal-demo.milvus.io/
#2 - Chat with Open Source Software
● https://osschat.io/
Getting Started with
Milvus and Workshop
Getting Started
● Google Colab notebook
Choose Your Own Adventure
Contribute your first issue
Challenge: Mini-projects
Workshop: Semantic
Search with Milvus
Milvus Bootcamp Tutorials
Contributing Your First Issue
● A few first-time-contributor-suitable issues:
○ [Feature Request]: [Milvus]Collection load
○ [Feature Request]: [Milvus]Shards Number
[Feature Request]: [Milvus]Timeout
[Feature Request]: [Milvus]Partition
● Help add integrations of Milvus and new RAG/agent frameworks:
○ dynamiq-ai/dynamiq
○ CrewAI
○ EmbedAnything
○ Firebase GenKit
○ Cognee
https://zilliz.com/blog/contributing-to-open-source-milvus-beginners-guide
Contributing Your First Issue
You can also contribute to Milvus directly:
• Issues · milvus-io/milvus · GitHub
by following the contribution instructions, and see
• Contributing to Open Source Milvus: A Beginner’s Guide
Milvus Mini-Projects
Call for contribution of projects and demos built with Milvus. Get
inspired by examples here. We can feature them on Discord.
Suggestions:
• RAG with Contextual Retrieval
• Introducing Contextual Retrieval  Anthropic
• RAG for an application that requires structured output
• GitHub - dottxt-ai/outlines: Structured Text Generation
• Structured Outputs - OpenAI API
• RAG on Wikipedia
• wikimedia/wikipedia · Datasets at Hugging Face
Bootcamp Tutorials
https://milvus.io/bootcamp
Workshop: Building Semantic Search with Milvus
● Google Colab notebook
Generative AI Resource Hub
• Generative AI Resource Hub | Zilliz
Open Discussion Time
10/8/2024 San Francisco
10/15/2024 Silicon Valley
10/23/2024 New York
Held twice a month
We’re Hiring!
● Engineering Manager, Database Systems
● Sr SW Engineer, Distributed Systems
● SRE, Cloud Platform
● Staff SW Engineer, Cloud Platform
● Staff SW Engineer, Database Systems
https://zilliz.com/careers#open-positions
THANK
YOU
https://milvus.io/discord
https://github.com/milvus-io/milvus
https://x.com/milvusio
https://www.linkedin.com/company/the-
milvus-project
LET’S STAY
CONNECTED!

More Related Content

PDF
Continuous Delivery Without Breaking Everything
PDF
Multimodal Search with Open-Source Tools
PDF
LV Dev Efficiency NIDays 2015
PDF
8 Principles for Enabling Build/Measure/Learn: Lean Engineering in Action
PPTX
"Project Tye to Tie .NET Microservices", Oleg Karasik
PDF
How open source is driving DevOps innovation: CloudOpen NA 2015
PDF
Optimizing developer onboarding
PDF
Devops at SlideShare: Talk at Devopsdays Bangalore 2011
Continuous Delivery Without Breaking Everything
Multimodal Search with Open-Source Tools
LV Dev Efficiency NIDays 2015
8 Principles for Enabling Build/Measure/Learn: Lean Engineering in Action
"Project Tye to Tie .NET Microservices", Oleg Karasik
How open source is driving DevOps innovation: CloudOpen NA 2015
Optimizing developer onboarding
Devops at SlideShare: Talk at Devopsdays Bangalore 2011

Similar to 2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan (20)

PDF
DevOps and its impact
PPTX
Live coding a machine learning app
PPTX
Jfokus_Bringing the cloud back down to earth.pptx
PDF
JavaZone 2016 - The DevOps disaster
PDF
DevOps Utrecht - The DevOps Disaster
PPTX
FooConf23_Bringing the cloud back down to earth.pptx
PPT
Drupal Basics
PDF
Tbjsphx918
PDF
Bluemix and watson overview - Rencontres IBM et l'Ecole Polytechnique - 3 nov...
PDF
Run Your Java Code on Cloud Foundry - Andy Piper (Pivotal)
PDF
Run your Java apps on Cloud Foundry
PDF
Built to Scale: The Mozilla Release Engineering toolbox
PPTX
Genomics data insights
PPTX
Rakuten and Microsoft talk DevOps in Real World
PDF
Codemotion Amsterdam 2016 - The DevOps Disaster
PPTX
Community IT Webinar - Dropbox vs OneDrive
PDF
GraphConnect Europe 2016 - NoSQL Polyglot Persistence: Tools and Integrations...
PPTX
Why we fail at ml ai why we fail at ml_ai
PPTX
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
PDF
Continuos Integration and Delivery: from Zero to Hero with TeamCity, Docker a...
DevOps and its impact
Live coding a machine learning app
Jfokus_Bringing the cloud back down to earth.pptx
JavaZone 2016 - The DevOps disaster
DevOps Utrecht - The DevOps Disaster
FooConf23_Bringing the cloud back down to earth.pptx
Drupal Basics
Tbjsphx918
Bluemix and watson overview - Rencontres IBM et l'Ecole Polytechnique - 3 nov...
Run Your Java Code on Cloud Foundry - Andy Piper (Pivotal)
Run your Java apps on Cloud Foundry
Built to Scale: The Mozilla Release Engineering toolbox
Genomics data insights
Rakuten and Microsoft talk DevOps in Real World
Codemotion Amsterdam 2016 - The DevOps Disaster
Community IT Webinar - Dropbox vs OneDrive
GraphConnect Europe 2016 - NoSQL Polyglot Persistence: Tools and Integrations...
Why we fail at ml ai why we fail at ml_ai
JAXLondon 2015 "DevOps and the Cloud: All Hail the (Developer) King"
Continuos Integration and Delivery: from Zero to Hero with TeamCity, Docker a...
Ad

More from Timothy Spann (20)

PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
PDF
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
PDF
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
PDF
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
PDF
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
PPTX
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
Ad

Recently uploaded (20)

PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
Global Data and Analytics Market Outlook Report
PPTX
New ISO 27001_2022 standard and the changes
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
modul_python (1).pptx for professional and student
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Introduction to Data Science and Data Analysis
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
A Complete Guide to Streamlining Business Processes
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Introduction to Inferential Statistics.pptx
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Global Data and Analytics Market Outlook Report
New ISO 27001_2022 standard and the changes
Qualitative Qantitative and Mixed Methods.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
modul_python (1).pptx for professional and student
Pilar Kemerdekaan dan Identi Bangsa.pptx
ISS -ESG Data flows What is ESG and HowHow
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Topic 5 Presentation 5 Lesson 5 Corporate Fin
IBA_Chapter_11_Slides_Final_Accessible.pptx
Introduction to Data Science and Data Analysis
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
IMPACT OF LANDSLIDE.....................
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
A Complete Guide to Streamlining Business Processes
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Introduction to Inferential Statistics.pptx

2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan