Can Bun close the performance gap between TypeScript and Go?

Partially. Bun's HTTP server achieves 70-80% of Go's throughput for typical search API workloads. The remaining gap comes from JavaScript's single-threaded event loop and garbage collection overhead. For most teams, Bun's performance is sufficient — the decision should be based on ecosystem and team expertise, not raw throughput.

Should I use Vercel AI SDK or build custom RAG in TypeScript?

Use Vercel AI SDK for streaming responses in Next.js applications — it handles SSE, React Server Components, and client-side state management well. Build custom RAG for the retrieval and context-building logic — the AI SDK's retrieval abstractions are opinionated and harder to debug than direct vector database queries.

How do I handle concurrent search requests in TypeScript?

Node.js handles concurrent I/O naturally through its event loop. Each search request (embed → query vector DB → return) is I/O-bound, so thousands of concurrent requests work fine on a single thread. Only CPU-bound operations (local distance computation, result re-ranking) need worker threads. For typical search APIs calling external databases, you won't hit concurrency limits.

Is there a cost difference in hosting TypeScript vs Go services?

At small scale (under 3K QPS), hosting costs are identical — both fit on a $50/month instance. At 10K+ QPS, Go's 2-3x memory efficiency and higher throughput mean you need fewer instances: 2 Go instances versus 4-5 Node.js instances for the same traffic. The annual savings is typically $3K-8K — meaningful for startups, negligible for larger companies.

Vector Database Architecture: Typescript vs Go in 2025

TypeScript and Go represent two pragmatic choices for building vector search services. TypeScript fits naturally when your AI application runs on Next.js or Node.js. Go fits when vector search is a backend service behind an API boundary. This comparison covers the real trade-offs for teams deciding between them — not for building a vector database engine (use Rust for that), but for the application and service layers that use one.

Performance Benchmarks

Benchmarked on AWS c6i.2xlarge (8 vCPU, 16GB RAM). Both languages calling into external vector databases (Qdrant) and embedding APIs (OpenAI).

API Serving Performance

Metric	TypeScript (Next.js)	TypeScript (Hono/Bun)	Go (Chi)
Search endpoint RPS	3,200	8,400	11,200
p50 latency	2.8ms	1.1ms	0.7ms
p99 latency	14ms	5.2ms	3.2ms
Memory per instance	180 MB	95 MB	85 MB
Cold start	1.8s	0.4s	0.3s

Go is consistently faster for HTTP serving, but Bun-based TypeScript closes the gap significantly. Next.js is the slowest due to framework overhead, but its developer experience advantages (server components, streaming, caching) often outweigh raw throughput for user-facing applications.

Embedding Pipeline (10K documents via OpenAI API)

Metric	TypeScript (async/await)	Go (goroutines)
Total time	42s	44s
Concurrent requests	10	10
Error handling	try/catch, typed	error returns
Code complexity	Low	Low

Embedding generation is API-bound. Both languages handle concurrent HTTP requests well. TypeScript's Promise.all and Go's goroutines achieve similar throughput for I/O-bound work.

Vector Database Client Performance

Operation	TypeScript (@qdrant/js-client-rest)	Go (qdrant-go)
Upsert 10K vectors	1.2s	0.9s
Search (top-10)	8ms	5ms
Batch search (100 queries)	340ms	180ms
Serialization overhead	~2ms/request	~0.5ms/request

Go's advantage comes from less serialization overhead (no JSON.parse/stringify for gRPC) and more efficient concurrent request handling. For most applications, this difference is invisible behind network latency.

Code Comparison

Search Service

TypeScript (Next.js App Router):

typescript

1// app/api/search/route.ts

2import { NextRequest, NextResponse } from 'next/server';

3import { QdrantClient } from '@qdrant/js-client-rest';

4import OpenAI from 'openai';

6const qdrant = new QdrantClient({ url: 'http://qdrant:6333' });

7const openai = new OpenAI();

9export async function POST(request: NextRequest) {

10 const { query, tenantId, topK = 10 } = await request.json();

12 // Generate embedding

13 const embeddingResponse = await openai.embeddings.create({

14 input: [query],

15 model: 'text-embedding-3-small',

16 });

18 // Search

19 const results = await qdrant.search(tenantId, {

20 vector: embeddingResponse.data[0].embedding,

21 limit: topK,

22 with_payload: true,

23 });

25 return NextResponse.json({

26 results: results.map((r) => ({

27 id: r.id,

28 score: r.score,

29 text: r.payload?.text,

30 })),

31 });

32}

Go (Chi Router):

1package handlers

3import (

4 "context"

5 "encoding/json"

6 "net/http"

8 "github.com/go-chi/chi/v5"

9 "github.com/qdrant/go-client/qdrant"

10 openai "github.com/sashabaranov/go-openai"

11)

13type SearchHandler struct {

14 qdrant *qdrant.Client

15 openai *openai.Client

16}

18type SearchRequest struct {

19 Query string `json:"query"`

20 TenantID string `json:"tenantId"`

21 TopK int `json:"topK"`

22}

24func (h *SearchHandler) Search(w http.ResponseWriter, r *http.Request) {

25 var req SearchRequest

26 json.NewDecoder(r.Body).Decode(&req)

28 if req.TopK == 0 {

29 req.TopK = 10

30 }

32 // Generate embedding

33 embResp, err := h.openai.CreateEmbeddings(

34 r.Context(),

35 openai.EmbeddingRequest{

36 Input: []string{req.Query},

37 Model: openai.SmallEmbedding3,

38 },

39 )

40 if err != nil {

41 http.Error(w, "embedding failed", http.StatusInternalServerError)

42 return

43 }

45 // Search

46 results, err := h.qdrant.Search(r.Context(), &qdrant.SearchRequest{

47 CollectionName: req.TenantID,

48 Vector: embResp.Data[0].Embedding,

49 Limit: uint64(req.TopK),

50 WithPayload: true,

51 })

52 if err != nil {

53 http.Error(w, "search failed", http.StatusInternalServerError)

54 return

55 }

57 type Result struct {

58 ID string `json:"id"`

59 Score float32 `json:"score"`

60 Text string `json:"text"`

61 }

63 output := make([]Result, len(results))

64 for i, r := range results {

65 output[i] = Result{

66 ID: fmt.Sprintf("%v", r.Id),

67 Score: r.Score,

68 Text: r.Payload["text"].GetStringValue(),

69 }

70 }

72 json.NewEncoder(w).Encode(map[string]any{"results": output})

73}

Both implementations are roughly the same complexity. TypeScript is slightly more concise due to destructuring and the simpler error model.

Type Safety Comparison

TypeScript and Go both offer static typing, but with different ergonomics:

typescript

1// TypeScript: Structural typing, generics, union types

2interface SearchResult<T = Record<string, unknown>> {

3 id: string;

4 score: number;

5 payload: T;

8type DocumentPayload = {

9 text: string;

10 docId: string;

11 chunkIndex: number;

12};

14// The type system catches misuse at compile time

15function processResults(results: SearchResult<DocumentPayload>[]) {

16 return results.map((r) => ({

17 excerpt: r.payload.text.slice(0, 200), // Type-checked

18 source: r.payload.docId,

19 }));

20}

1// Go: Explicit types, less inference, no generics for complex patterns

2type SearchResult struct {

3 ID string `json:"id"`

4 Score float32 `json:"score"`

5 Payload map[string]interface{} `json:"payload"`

8// Requires runtime type assertions for dynamic payloads

9func processResults(results []SearchResult) []ProcessedResult {

10 out := make([]ProcessedResult, len(results))

11 for i, r := range results {

12 text, _ := r.Payload["text"].(string)

13 docID, _ := r.Payload["docId"].(string)

14 out[i] = ProcessedResult{

15 Excerpt: text[:min(200, len(text))],

16 Source: docID,

17 }

18 }

19 return out

20}

TypeScript's structural typing and union types are more expressive for data transformation — common in vector search pipelines where payloads are semi-structured.

Need a second opinion on your AI systems architecture?

I run free 30-minute strategy calls for engineering teams tackling this exact problem.

Book a Free Call

Ecosystem for Vector Search

Capability	TypeScript	Go
OpenAI SDK	Excellent (official)	Good (community)
Pinecone client	Official, well-maintained	Community
Qdrant client	Official REST	Official gRPC
Weaviate client	Official	Official
Streaming responses	Native (ReadableStream)	Native (io.Reader)
RAG frameworks	LangChain.js, Vercel AI SDK	None mature
Testing	Jest/Vitest, excellent DX	go test, good DX

TypeScript has better AI/LLM SDK support. Go has better infrastructure and observability tooling. Choose based on which matters more for your use case.

When to Choose TypeScript

Full-stack AI apps: When search is part of a Next.js application with React frontend
Rapid prototyping: TypeScript's ecosystem lets you go from idea to demo faster
Frontend-heavy teams: If your engineers primarily write TypeScript/JavaScript
Streaming UX: Server-sent events and streaming responses are natural in Next.js
Small scale: Under 5K QPS, TypeScript's throughput is sufficient

When to Choose Go

Backend microservices: When search is an internal service called by other backends
High throughput: Above 5K QPS, Go's efficiency matters for cost
Infrastructure teams: If your platform is Go-based (Kubernetes controllers, operators)
Low memory budget: Go uses 2-3x less memory per instance
Multiple clients: When the search API serves mobile, web, and CLI clients

FAQ

Need expert help?

Building with agentic AI?

I help teams ship production-grade systems. From architecture review to hands-on builds.

Book a Free Call Send a Brief

vector-db embeddings similarity-search ai-infrastructure typescript comparison

Muneer Puthiya Purayil

SaaS Architect & AI Systems Engineer. 10+ years shipping production infrastructure across fintech, automotive, e-commerce, and healthcare.

View Portfolio Book a Call

← Previous