Frequently Asked Questions (FAQs)
1. How should we manage repositories for Chatbot?
Organize repositories based on modules:
Chatbot Backend
NLP Engine (Gen AI)
Terraform scripts for Deployments
Knowledge Content
2. How should we scale our infrastructure?
Evaluate scaling strategies based on demand and growth patterns.
Consider horizontal scaling for adding more nodes or vertical scaling for enhancing existing nodes.
3. Should we use Network Load Balancer or Application Load Balancer?
Decide based on specific requirements:
Network Load Balancer (NLB) for even traffic distribution.
Application Load Balancer (ALB) for advanced routing, SSL termination, and content-based routing.
4. How should we scale Vector Database Infrastructure?
Assess growth patterns and scaling needs.
Implement horizontal scaling by adding more nodes or vertical scaling by enhancing existing nodes.
5. Should we use a common GenAI module for all Chatbots?
Evaluate the benefits, considering performance, customization, and maintenance implications.
6. How to manage infrastructure?
Implement version control for infrastructure-as-code scripts.
Ensure proper documentation for onboarding and troubleshooting.
7. How to monitor the health of infrastructure?
Use tools like AWS CloudWatch to monitor health.
Set up alarms for critical metrics, e.g., EC2 instances utilization.
8. How to monitor our APIs?
Employ API monitoring tools to track response times, error rates, and overall health.
9. How to identify performance bottlenecks?
Conduct regular performance testing.
Utilize profiling tools and metrics to pinpoint areas of improvement.