Background and Purpose
This SOP outlines best practices for creating and configuring a knowledge base to enable AI assistants to deliver accurate, domain-specific, and business-specific answers. The process ensures consistency and reliability in handling FAQs, pricing, objection handling, and other contextual information.Core Concepts
How Knowledge Bases Work
Context-Based Processing:- Knowledge is provided as context on a per-interaction basis when engaging with leads or contacts
- The system processes the contact’s message and compares it to information in your knowledge base
- AI generates specific responses based on contextual matching
- The AI does not inherently “know” the knowledge
- Instead, it uses information fed as context to assist the language model
- The LLM treats it as contextual input, not as a traditional database
Quality Principle
Critical Insight: The quality of the knowledge output is directly tied to the quality of the input. Poor, inaccurate, or unclear data in the knowledge base will lead to similarly poor responses.
Most Effective Information Types
Key-Value Pairs and Cause-Effect Relationships:- FAQs - Question and answer pairs
- Objection handling strategies - Common concerns and responses
- Domain-specific information - Industry knowledge
- Business-related questions - Company-specific details
Best Practices
Input Type Hierarchy
Preferred Input Types (Highest Quality)
- Raw Text Input - Manually crafted, accurate content
- FAQ Input - Structured question-answer pairs
Secondary Input Types (Use with Caution)
- Document Uploads - May contain outdated information
- Website Scrapes - Risk of conflicting or irrelevant data
Why Raw Text and FAQs Are Preferred
Advantages:- Higher reliability for enterprise deployments
- Better control over information quality
- Reduced risk of conflicting or outdated data
- Easier monitoring and management
- May introduce conflicting information
- Could contain outdated content
- Might include irrelevant details
- Harder to quality control
Content Sourcing Strategy
Recommended Sources for Raw Text
- Company website (current, accurate pages)
- Official documentation
- YouTube transcripts (verified accuracy)
- Community posts (fact-checked content)
- Internal training materials
Content Generation Process
- Source raw text from reliable materials
- Use AI tools (ChatGPT, Claude) to generate FAQs
- Tailor content for specific use cases:
- Sales-focused FAQs
- Support-oriented responses
- Success team materials
- Translate content for different languages
- Ensure objective input for objective output
Temperature Settings for Precision
For Businesses Requiring Precise Information
- Set AI temperature to 0-0.2
- Benefits:
- Ensures deterministic responses
- Reduces random variations
- Provides consistent answers
- Minimizes hallucination risk
Monitoring Knowledge Base Output
Accessing Transparency Logs
Step-by-Step Log Analysis
- Navigate to conversation logs in buildassistants.app
- Click the bracket icon "" to open transparency logs
- Look for two key log types:
- “Embedding” - Shows what’s being processed
- “Embed complete” - Shows knowledge base output
Analyzing Knowledge Base Performance
- Open “Embed complete” to see knowledge base output for contact’s message
- Review retrieved information for relevance and accuracy
- Identify gaps or incorrect information
- Note areas requiring improvement
Optimization Based on Logs
When Output Isn’t Ideal
- Return to knowledge base
- Create specific FAQ around the question asked
- Add more context for future similar questions
- Test with same query to verify improvement
Content Distribution Strategy
Proper Channel for Each Content Type
Prompt Should Contain:
- Personality (Identity) - Who the AI assistant is
- Response Guidelines - How to communicate
- Style Guardrails - Tone and approach
- Important Points - Key behavioral notes
- Instruction Set - What to do and when
Knowledge Base Should Contain:
- Domain-specific knowledge - Industry expertise
- Business-related information - Company details
- FAQs - Common questions and answers
- Objection handling - Response strategies
- Basic pricing details - Simple pricing information
- Key-value pair outputs - Structured responses
Tools Should Handle:
- Context injection - Dynamic information insertion
- Conditional instructions - Logic-based responses
- Data retrieval - Live information fetching
- Specific actions - Appointment booking, calculations
- Parameter scaling - Dynamic functionality
Complex Data Handling
When to Use Tools vs. Knowledge Base
Use Tools For:- Live data that changes frequently
- Complex pricing calculations
- Dynamic information requiring real-time updates
- Integration with third-party services
“AI is very smart, but not intuitive” - Complex operations often require structured tool calls rather than knowledge base storage.
Step-by-Step Implementation
1. Prepare the Knowledge Base Framework
Initial Setup
- Log into buildassistants.app
- Create a blank assistant with no pre-configured prompts or tools
- Set temperature to zero for deterministic and objective answers
Configuration Benefits
- Clean slate ensures no conflicting instructions
- Zero temperature provides consistent responses
- Focused testing on knowledge base alone
2. Gather Domain-Specific Content
Content Collection Strategy
- Collect source materials:
- Website content (current pages only)
- FAQ documents
- Internal guides and documentation
- Selective scraping to avoid inaccuracies
- Verify accuracy of all collected content
Quality Control
- Review all content for currency
- Remove outdated information
- Fact-check all claims and details
- Standardize formatting for consistency
3. Generate FAQs
AI-Assisted FAQ Creation
- Paste relevant content into AI tool (Claude or ChatGPT)
- Use specific prompt:
- Review and refine generated questions and answers
- Ensure accuracy of all FAQ pairs
FAQ Quality Standards
- Clear, specific questions
- Comprehensive, accurate answers
- Appropriate length for context
- Professional tone matching brand voice
4. Organize and Upload FAQs
Category Structure
Create clear categories for organization:- “FAQ Homepage” - General company information
- “FAQ Custom Tools” - Technical functionality
- “FAQ Pricing” - Cost and plan information
- “FAQ Support” - Help and troubleshooting
- “FAQ Sales” - Sales process and policies
Upload Process
- Use clear naming conventions for easy management
- Copy-paste each FAQ pair into appropriate category
- Maintain consistent formatting across all entries
- Double-check categorization for logical organization
5. Enhance with Elaborated Responses
Content Expansion
- Use AI tool to elaborate on existing FAQs
- Request specific focus:
- Upload expanded responses under corresponding categories
- Ensure expanded content maintains accuracy
Specialization Options
- Sales-focused elaborations
- Technical implementation details
- Support-oriented explanations
- Success team guidance
6. Refine and Validate Data
Comprehensive Audit Process
Check for Relevance:- Ensure answers align with current business model
- Remove outdated business practices
- Update changed policies or procedures
- Fact-check all technical details
- Validate pricing and plan information
- Confirm contact information and processes
- Replace misaligned entries as needed
- Delete completely outdated information
- Update partially correct entries
7. Test Knowledge Base Functionality
Systematic Testing Approach
-
Ask sample questions relevant to your business:
- “How much are voice minutes?”
- “What plans do you offer?”
- “How does billing work?”
- “What support do you provide?”
- Verify answers are drawn from knowledge base
- Check response accuracy against source material
- Test edge cases and unusual questions
Testing Validation
- Responses match uploaded content
- No hallucination or made-up information
- Appropriate context selection
- Professional tone maintained
8. Optimize Answers Through Refinement
Iterative Improvement Process
- Analyze feedback from testing and real usage
- Identify gaps in knowledge coverage
- Add specific details for common inquiries
Example Optimization:
Original: Basic plan information Enhanced: “What plans do you have?” → Add detailed pricing, plan differences, and feature comparisonsContinuous Refinement
- Monitor conversation logs regularly
- Update based on frequent questions
- Refine answers for clarity
- Add context where needed
9. Integrate with Tools for Advanced Features (Optional)
When to Use Tool Integration
Live Data Requirements:- Real-time pricing that changes frequently
- Inventory levels or availability
- Dynamic calculations based on user input
Integration Options
- Airtable for structured data
- Google Sheets for collaborative data management
- API endpoints for real-time information
- Custom tools for specific business logic
10. Deploy and Monitor
Production Configuration
-
Set response wait time:
- Zero seconds during testing phase
- 15-20 seconds for production (human-like interaction)
-
Monitor performance:
- Review logs regularly
- Assess embedding and context usage
- Track response quality metrics
Ongoing Maintenance
- Weekly log reviews
- Monthly knowledge base audits
- Quarterly comprehensive updates
- Annual strategy reviews
Definition of Done
The knowledge base optimization is complete when:- ✅ Knowledge base fully populated with accurate, domain-specific data
- ✅ AI consistently delivers objective and relevant answers
- ✅ Testing confirms robust and scalable outputs
- ✅ Logs validate answers are pulled correctly from knowledge base
- ✅ Temperature set to zero for deterministic responses
- ✅ Categories organized with clear naming conventions
- ✅ Content quality meets professional standards
Advanced Optimization Techniques
A/B Testing for Knowledge Content
- Test different answer formats for same questions
- Compare response effectiveness across variations
- Optimize based on user engagement
- Implement winning content versions
Performance Analytics
- Track knowledge retrieval success rates
- Monitor response relevance scores
- Analyze user satisfaction with answers
- Identify knowledge gaps through analytics
Multilingual Considerations
- Translate core FAQs to target languages
- Maintain consistency across language versions
- Test cultural appropriateness of responses
- Update translations when source content changes
FAQ
Why is temperature set to zero?
A: To ensure deterministic outputs, eliminating randomness in answers. This provides consistent, reliable responses every time the same question is asked.What types of content should go into the knowledge base?
A:- Pricing details and plan information
- FAQs and objection handling guides
- Sales and support scripts
- Domain-specific documentation
- Company policies and procedures
- Product feature explanations
How can I test knowledge base accuracy?
A:- Ask targeted questions directly related to uploaded content
- Check transparency logs to ensure responses are pulled from correct sources
- Compare AI answers with source material
- Test edge cases and variations of questions
What if my website data is outdated?
A:- Avoid direct scrapes of potentially outdated content
- Manually review and refine content before uploading
- Use current documentation as primary source
- Regular audits to maintain accuracy
Can I update the knowledge base later?
A: Yes, frequently audit and replace outdated information for sustained accuracy. Knowledge bases should be living documents that evolve with your business.How often should I update the knowledge base?
A:- Weekly reviews of conversation logs
- Monthly content audits
- Quarterly major updates
- Immediate updates for critical changes (pricing, policies)
What’s the difference between knowledge base and custom tools?
A: Knowledge bases store static information for context, while custom tools handle dynamic data, calculations, and real-time integrations. Use knowledge bases for FAQs and tools for live data.Best Practices Summary
Content Quality
- Prioritize accuracy over quantity
- Use reliable sources for all information
- Maintain consistency in tone and format
- Regular quality audits to ensure currency
Organization Strategy
- Clear categorization for easy management
- Logical naming conventions for quick retrieval
- Hierarchical structure for complex topics
- Cross-referencing for related topics
Performance Optimization
- Zero temperature for consistent responses
- Focused content relevant to user needs
- Regular monitoring through transparency logs
- Iterative improvement based on usage data