Git Repo Research MCP Server
Model Context Protocol (MCP) server for researching Git repositories using semantic search
This MCP server enables developers to research external Git repositories and influence their code generation without having to clone repositories to local projects. It provides tools to index, search, and explore Git repositories using semantic search powered by Amazon Bedrock and FAISS.
Features
- Repository Indexing: Create searchable FAISS indexes from local or remote Git repositories
- Semantic Search: Query repository content using natural language and retrieve relevant code snippets
- Repository Summary: Get directory structures and identify key files like READMEs
- GitHub Repository Search: Find repositories in AWS-related organizations filtered by licenses and keywords
- File Access: Access repository files and directories with support for both text and binary content
Prerequisites
Installation Requirements
- Install
uv
from Astral or the GitHub README - Install Python 3.12 or newer using
uv python install 3.12
-
- uv - Fast Python package installer and resolver
- AWS credentials configured with Bedrock access
- Node.js (for UVX installation support)
AWS Requirements
- AWS CLI Configuration: You must have the AWS CLI configured with credentials that have access to Amazon Bedrock
- Amazon Bedrock Access: Ensure your AWS account has access to embedding models like Titan Embeddings
- Environment Variables: The server uses
AWS_REGION
andAWS_PROFILE
environment variables
Optional Requirements
- GitHub Token: Set
GITHUB_TOKEN
environment variable for higher rate limits when searching GitHub repositories
Installation
To add this MCP server to your Amazon Q or Claude, add the following to your MCP config file:
{
"mcpServers": {
"awslabs.git-repo-research-mcp-server": {
"command": "uvx",
"args": ["awslabs.git-repo-research-mcp-server@latest"],
"env": {
"AWS_PROFILE": "your-profile-name",
"AWS_REGION": "us-west-2",
"FASTMCP_LOG_LEVEL": "ERROR",
"GITHUB_TOKEN": "your-github-token"
},
"disabled": false,
"autoApprove": []
}
}
}
Tools
create_research_repository
Indexes a Git repository (local or remote) using FAISS and Amazon Bedrock embeddings.
create_research_repository(
repository_path: str,
output_path: Optional[str] = None,
embedding_model: str = "amazon.titan-embed-text-v2:0",
include_patterns: Optional[List[str]] = None,
exclude_patterns: Optional[List[str]] = None,
chunk_size: int = 1000,
chunk_overlap: int = 200
) -> Dict
search_research_repository
Performs semantic search within an indexed repository.
search_research_repository(
index_path: str,
query: str,
limit: int = 10,
threshold: float = 0.0
) -> Dict
search_research_repository_suggestions
Searches for GitHub repositories based on keywords, scoped to AWS organizations.
search_research_repository_suggestions(
keywords: List[str],
num_results: int = 5
) -> Dict
access_file
Accesses file or directory contents within repositories or on the filesystem.
access_file(
filepath: str
) -> Dict | ImageContent
delete_research_repository
Deletes an indexed repository.
delete_research_repository(
repository_name_or_path: str,
index_directory: Optional[str] = None
) -> Dict
Resources
repositories://{repository_name}/summary
Get a summary of an indexed repository including structure and helpful files.
repositories://awslabs_mcp/summary
repositories://
List all indexed repositories with detailed information.
repositories://
repositories://{index_directory}
List all indexed repositories from a specific index directory.
repositories:///path/to/custom/index/directory
Considerations
- Repository indexing requires Amazon Bedrock access and sufficient permissions
- Large repositories may take significant time to index
- Binary files (except images) are not supported for content viewing
- GitHub repository search is by default limited to AWS organizations: aws-samples, aws-solutions-library-samples, and awslabs (but can be configured to include other organizations)