What is Huddix and how does it work?

Huddix is an AI-powered meeting intelligence platform. Once installed, it automatically detects when you join a meeting on Zoom, Google Meet, or Microsoft Teams. It captures the audio, transcribes everything with speaker labels, generates AI summaries with action items, and stores searchable memories across all your meetings.

Which meeting platforms does Huddix support?

Huddix works with all major meeting platforms including Zoom, Google Meet, Microsoft Teams, Webex, and any app that uses your microphone. The desktop app auto-detects meetings — no browser extensions or bots needed.

Is Huddix free to use?

Yes! Huddix offers a free plan that includes local transcription and up to 5 meetings per month. Our Pro plan ($10/month) adds HD cloud transcription, AI summaries, semantic memory search, voice fingerprinting, and calendar integration with 30 hours per month.

How accurate is the transcription?

Huddix uses AssemblyAI for HD cloud transcription, achieving over 95% accuracy for clear audio. Our system includes automatic speaker diarization, so you always know who said what — even in meetings with many participants.

What is voice fingerprinting?

Voice fingerprinting uses neural network technology (ECAPA-TDNN) to create a unique voice profile for each speaker. This allows Huddix to automatically recognize and label speakers across different meetings — even when they switch devices or join from different accounts.

How does the cross-meeting memory work?

After each meeting, Huddix extracts key facts, decisions, and action items and stores them as semantic embeddings using pgvector. You can search across all your meetings using natural language — like asking "What did Sarah say about the Q4 budget?" and getting instant, accurate results.

Is my meeting data secure?

Absolutely. Huddix uses end-to-end encryption, row-level security in our database, and follows SOC 2 security practices. Your audio and transcripts are only accessible to you. You can export or delete all your data at any time.

What platforms is the desktop app available on?

The Huddix desktop app is available for macOS (Apple Silicon and Intel), Windows, and Linux. Download it from our download page and set it up in under a minute.

Back to Blog

Engineering Apr 5, 2026 7 min read

Building a Semantic Memory System with pgvector

How we use OpenAI embeddings and pgvector to build a cross-meeting memory graph that lets you search across all your conversations with natural language.

Huddix Team

One of the most powerful features of Huddix is the ability to search across all your meetings using natural language. "What did Sarah say about the Q4 roadmap?" or "When did we discuss the new pricing model?" — and you get instant, accurate answers. This post explains how we built this semantic memory system.

The Challenge

Traditional search relies on keyword matching. If you search for "Q4 roadmap" but the meeting transcript says "product plan for Q4," you might miss relevant results. Semantic search solves this by understanding the meaning behind your query and the content you're searching.

Our Architecture

1. Embedding Generation

For each meeting, we generate embeddings for:

The full transcript
The AI summary
Extracted key points and decisions
Action items

We use OpenAI's text-embedding-3-large model, which produces 3072-dimensional vectors with state-of-the-art performance on semantic similarity benchmarks.

2. Vector Storage with pgvector

We store embeddings in PostgreSQL using the pgvector extension. Each meeting's embeddings are stored alongside metadata (date, participants, topics) in a single table with a vector index for efficient similarity search.

3. Hybrid Search

When you search "What did Sarah say about Q4?", we:

Generate an embedding for your query
Find similar embeddings using vector similarity (cosine distance)
Filter by participants if you mention a name
Apply keyword boosting for exact matches
Re-rank results using our relevance model

4. Memory Graph

We go beyond simple vector search by building a memory graph. Entities (people, projects, companies) are extracted and linked across meetings. This enables multi-hop reasoning: "What projects did the engineering team discuss that were related to the Q4 launch?"

Performance

Our semantic search system handles 500K+ meetings with:

~50ms average query latency (p95)
94% relevance score (human eval)
99.9% uptime

Privacy and Security

All embeddings are encrypted at rest and in transit. User workspaces are isolated with row-level security — you can only search your own meetings. We never use your data to train models.

Future Directions

We're exploring temporal reasoning ( "how has the team's opinion on X evolved over time?"), multi-modal search (finding meetings based on slides or shared screens), and proactive memory suggestions ( "you might want to revisit this discussion from 3 months ago").

Engineering

How We Built Cross-Meeting Voice Fingerprinting

Using SpeechBrain's ECAPA-TDNN model, we can identify speakers across meetings with just seconds of ...