Skip to content

Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and contents of any given code base

License

Notifications You must be signed in to change notification settings

shaneholloman/codemapper

Repository files navigation

CodeMapper

Total Downloads Pylint PyTest TODO

logo

Overview

Important

CodeMapper: Bridging Code Understanding for AI and Human Analysis

Install: uv tool install codemapper to your system. This will add a binary to your path. Zero conflicts with other tools. Of course you may use the traditional pip install codemapper as well.

CodeMapper is a powerful Python tool that transforms complex codebases into navigable, single-file Markdown artifacts, with a unique ability to bootstrap AI chat prompts for code analysis. Designed with both AI engineers and hobbyist developers in mind, it serves as a bridge between traditional code exploration and modern AI-assisted development workflows. Whether you're training large language models, conducting interactive AI-assisted code reviews, or simply trying to understand a new project, CodeMapper provides a unified, accessible view of your codebase that's optimized for both human readability and AI consumption.

Key Features

  • Unified Code Visualization: Automatically generates a comprehensive Markdown representation of your entire codebase, including:

    • Complete directory structure visualization
    • Syntax-highlighted code content
    • Intelligent file categorization
    • Documentation aggregation
  • AI Integration Optimizations:

    • Structured output format ideal for LLM training and analysis
    • Consistent formatting for improved token efficiency
    • Built-in support for common documentation patterns
    • Metadata preservation for enhanced context
  • Git-Aware Processing:

    • Respects .gitignore rules by default
    • Direct GitHub repository URL support
    • Handles large repositories efficiently
    • Smart binary file detection
    • We always shallow clone for speed!
  • Customization Options:

    • Exclude and Include dir options for fine-grained control
    • Always excludes .venv, .conda and node_modules directories
  • Documentation Focus:

    • Dedicated DocMap generation for documentation-heavy projects
    • README file prioritization
    • Support for multiple documentation directory conventions
    • Markdown-native output for universal compatibility

AI Chat Integration

CodeMapper excels at bootstrapping AI chat interactions for code analysis. When you need to understand, debug, or enhance your code with AI assistance:

  1. Generate a codemap of your project
  2. Copy the generated markdown into your AI chat
  3. Start asking detailed questions about your codebase

The AI assistant receives:

  • Complete project structure
  • All file contents with proper syntax highlighting
  • Documentation in context
  • Clear navigation structure

This enables the AI to provide more accurate, contextual responses about your code without manual file copying or context limitations.

Use Cases

For AI Engineers

  • AI Chat Prompt Bootstrapping: Instantly generate context-rich prompts for AI chat interactions about your code
  • Interactive Code Analysis: Seamlessly feed comprehensive codebase context to AI assistants
  • Training Data Preparation: Generate consistently formatted codebase representations for model training
  • Documentation Generation: Train models on well-structured documentation patterns
  • Code Understanding: Feed entire codebases to LLMs for comprehensive analysis

Tip

Example AI Chat Workflow:

# Generate a codemap for your project
codemapper /path/to/project_or_repo

# The generated codemap can be directly used in AI chat prompts:
"Here's my project structure and code, help me understand the dependency flow:
[paste or add to you project knowledge *_codemap.md content]"

For Hobbyists

  • Project Exploration: Quickly understand new codebases without complex IDE setup
  • Documentation Creation: Generate comprehensive project documentation automatically
  • Code Reviews: Facilitate easier code reviews with a unified view
  • Learning Tool: Study and understand how different projects are structured

Getting Started

# Install from PyPI
pip install codemapper

# Basic usage
codemapper /path/to/project

# Generate documentation map
codemapper --docs /path/to/project

# Process GitHub repository
codemapper https://github.com/username/repo

Example Output

CodeMapper generates two main types of outputs:

  1. CodeMap (project_codemap.md):

    • Complete directory tree
    • File contents with syntax highlighting
    • Smart handling of binary and large files
    • Navigation-optimized structure
  2. DocMap (project_docmap.md):

    • Documentation-focused view
    • README files
    • Documentation directory contents
    • Structured for easy consumption

Future Development

Note

CodeMapper welcomes help while actively focused on:

ToDos

  • Expanded format support (XML, JSON, YAML, RST)
  • Enhanced AI integration capabilities
  • Real-time code change tracking
  • Collaborative annotation features
  • Intelligent code pattern recognition
  • API access for programmatic integration

Target Audiences

AI Engineers Will Appreciate

  • Consistent, clean output format for training data
  • Efficient handling of large repositories
  • Structured metadata preservation
  • Integration-ready design

Hobbyists Will Value

  • Simple, straightforward installation
  • Clear, readable output
  • No complex configuration required
  • Immediate utility for project understanding

Installation Options

# For basic traditional installation
pip install codemapper

# Preferred installation
uv tool install codemapper

# For development installation
git clone https://github.com/shaneholloman/codemapper
cd codemapper
uv venv
uv sync
uv pip install -e .

Contributing

CodeMapper welcomes contributions from both AI engineers and hobbyist developers. The project maintains a balance between sophisticated features for AI integration and accessibility for general use.

Visit the GitHub repository to:

  • Report issues
  • Submit feature requests
  • Contribute code
  • Join the discussion

Resources

CodeMapper represents a bridge between traditional code exploration and modern AI-assisted development, making codebases more accessible and understandable for everyone from AI researchers to hobbyist developers.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Thanks to the pathspec and chardet libraries for enhancing CodeMapper's functionality.

Version History

For a detailed version history, please refer to the changelog.md.


If you find CodeMapper useful, don't forget to star this repository!

About

Perfect for AI Prompts. CodeMapper is a python script that creates a comprehensive Markdown document representing the structure and contents of any given code base

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

 

Packages

No packages published