Defra AI in the SDLC Playbook

The set of Cursor rules files for python backend applications.

Cursor saves the rules files in the repository in the .cursor/rules directory. Each file below is a separate .md file in this directory.

project.mdc

---
description: Project Overview
globs: *.*
---
# Project Overview

## Stack
- Python
- FastAPI
- Anthropic
- MongoDB

## Directory Structure
```
src/
├── api/v1/      # Routing and HTTP endpoints
├── services/    # Business logic
├── repositories/# Data access
├── agents/      # AI agents
├── utils/       # Utility functions
├── config/      # Configuration
├── database/    # Database operations
└── models/      # Data models
```

## Key Principles
- Security-first approach
- Clean code practices
- Async by default

## Git Commits
- Use conventional commit prefixes (feat:, fix:, etc.)
- Keep messages concise and reference issues

## Security
- No secrets in code
- Validate inputs
- Encrypt sensitive data

code_style.mdc

---
description: Code Style for FASTAPI Projects
globs: *.py
---
# Code Style Guide

## Base Standards
- Follow PEP 8 conventions
- Use Pyright for type checking
- Comply with Pylint rules

### Formatting
- Indent: 4 spaces
- Line Length: 79 chars
- Docstring: Google style


## Naming Conventions
- Use `snake_case` for functions, variables, and modules
- Use `PascalCase` for classes
- Use `UPPER_CASE` for constants

## Import Order
1. stdlib
2. third_party
3. local

## Function Guidelines
- Maximum function length: 50 lines
- Maximum nesting depth: 3 levels
- Single responsibility principle
- Clear return types 

testing.mdc

---
description: Testing Standards for FastAPI projects
globs: test_*.py
---
# Testing Standards

## Core Principles
- Test behaviour, not implementation.
- Prefer integration tests over isolated unit tests. Testing multiple units together in a functional way
- Mock external dependencies only, at the lowest level (e.g. database operations, API calls, etc).
- Prioritise clarity and readability.
- Follow Given-When-Then pattern with inline comments

## Test Cases
- **Happy Path**: Ensure that valid data produces expected responses and database state.
- **Error Cases**: Cover invalid input (400), resource not found (404), server errors (500) and dependency failures e.g. database failures

## Best Practices
- **Organisation**: Group tests by endpoint or operation with clear comment headers (e.g. "# Test Cases - Create").
- **Naming**: Use descriptive test names (e.g. `test_operation_scenario_expected`).
- **Fixtures**: Use fixtures for common test data and mocks.
- **Assertions**: Validate status codes first, then response bodies.
- **Async Testing**: Use AsyncClient for async endpoints and AsyncMock for async operations.
- **Look for existing test patterns first**:
  - Check similar test files for patterns
  - Review conftest.py for established mocking approaches
  - Reuse existing test data from test_data.py

## Test File Structure
Each test file should follow this structure:
1. File docstring explaining purpose
2. Imports (stdlib, third-party, local)
3. Setup fixtures (autouse and specific)
4. Test cases grouped by operation
5. Helper functions (if needed)

## Database Mocking Approach
If the test require MongoDB - reference [03_mongodb_mocking.mdc](mdc:.cursor/rules/mongodb_mocking.mdc)

## Background Process Mocking
When testing endpoints that spawn background processes:

1. Always mock at the service level where the Process is instantiated
```python
# CORRECT - Mock where Process is created
with patch('app.services.standard_set_service.Process'):
    response = await async_client.post(...)

# INCORRECT - Mock at too high a level
with patch('multiprocessing.Process'):
    response = await async_client.post(...)
```

1. Common pitfalls to avoid:
   - Don't mock at the global multiprocessing level
   - Don't let test processes persist after test completion
   - Remember to mock in all test cases that trigger the service

### Test Data Management
1. ALL test data should be in test_data.py
2. Use `create_db_document` for generating test documents
3. Include all required fields, timestamps, and ObjectId
4. Provide helper functions for each document type (e.g. `create_classification_test_data`)
5. Use fixtures for input data validation (e.g. `valid_classification_data`)

## Test Structure Pattern
Each test should follow this structure:
```python
async def test_operation_scenario_expected(
    async_client,
    mock_database_setup,
    mock_collections,
    test_data_fixtures
):
    # Given: Setup context with clear comments
    collection = mock_collections
    repo = Repository(collection)
    service = Service(mock_database_setup, repo)
    app.dependency_overrides[get_service] = lambda: service
    
    # When: Action is performed
    response = await async_client...
    
    # Then: Verify results
    assert response.status_code == status.HTTP_XXX_XXX
    data = response.json()
    assert ... # Additional assertions
```

## Tools & Frameworks
- **pytest** with **pytest-asyncio** for async tests.
- **pytest-cov** for coverage (minimum 90%).
- **httpx** AsyncClient for async HTTP testing.
- FastAPI's **TestClient** for synchronous tests.

## FastAPI Testing Guidelines
- Configure TestClient/AsyncClient with the correct `base_url` and manage lifespan events.
- Override dependency providers (not the implementations) and reset overrides between tests.
- Validate responses against both status codes and Pydantic models.
- Use proper async/await patterns throughout tests.

## Test Directory Structure
```bash
tests/
├── conftest.py   # Fixtures and configuration
├── integration/  # Integration tests
├── unit/        # Unit tests   
└── utils/       # Utility functions and test data
``` 

mongodb_mocking.mdc

---
description: Mongo DB Database Mocking
globs: .py
---
# Best Practices for Mocking MongoDB in Tests

This document outlines the recommended rules and practices for mocking MongoDB interactions in our codebase. It builds on our current approach and includes additional suggestions for improved test isolation, clarity, and reliability.

Firstly, ensure you scan the entire testing patterns to determine if Mongo is already being mocked, rather than repeat the implmentation!

## 1. **General Principles**

- **Isolate Your Tests:**  
  Always use mocks to replace real MongoDB connections, ensuring tests run in isolation and without side effects.
  
- **Use Dependency Injection:**  
  Inject mocked repositories and services into your application (e.g., via FastAPI dependency overrides) to avoid accidental use of production code.

- **Consistent Naming Conventions:**  
  Name fixtures and mocks clearly (e.g., `mock_database_setup`, `setup_list_cursor`) to make it easy to trace their usage.

## 2. **Global Database Patching**

- **Patch Key Entry Points:**  
  - Patch `init_database` to return a mocked database.
  - Replace the actual MongoDB client (`motor.motor_asyncio.AsyncIOMotorClient`) with a `MagicMock`.
  - Patch helper functions like `get_database` to ensure all database interactions use the mock.

- **Ensure Complete Isolation:**  
  Confirm that any code initializing a connection to MongoDB uses the patched versions so no live database is ever contacted during tests.

## 3. **Collection and Cursor Mocks**

- **Asynchronous Collections:**  
  Use `AsyncMock` for collection methods that perform async operations (e.g., `find`, `insert_one`).

- **Cursor Chaining:**  
  When mocking cursor chains:
  - Make sure that methods such as `sort()` return the same cursor mock.
  - Example:
    ```python
    mock_cursor.sort = MagicMock(return_value=mock_cursor)
    ```

- **Property Patching:**  
  Use `PropertyMock` to simulate attribute access (e.g., linking a collection's `.database` attribute to the mocked database):
  ```python
  type(collection).database = PropertyMock(return_value=mock_db)
  ```

- **Dynamic Collection Resolution:**  
  If you have multiple collections, consider using a dictionary with a side_effect to route collection names correctly:
  ```python
  mock_db.get_collection = MagicMock(side_effect=lambda name: {
      "standards": standards_collection,
      "classifications": classifications_collection,
      # add other collections as needed
  }[name])
  ```

## 4. **Operation Mocks**

- **Find and Cursor Operations:**
  - Mock the find method to return a cursor whose to_list method returns your predefined list of documents.
  - Use a helper fixture like setup_list_cursor to standardize this behavior.

- **CRUD Operations:**
  - Insert: Mock insert_one to return an object with an inserted_id (consider using a helper fixture like setup_mock_result).
  - Update & Delete: Ensure update_one and delete_one return objects with modified_count, matched_count, or deleted_count.

- **Error Simulation:**
  - Leverage side_effect to simulate exceptions (e.g., side_effect=Exception('Database error')).
  - Always test both success and error scenarios.

- **Explicit Assertions:**
  - Use assertions such as assert_called_once_with to verify that mocked methods are called with the expected arguments.

## 5. **Service and Dependency Injection**

- **Mocking Services:**
  - Wrap collection mocks in repository classes and inject them into services.

- **Dependency Overrides:**
  - In FastAPI or similar frameworks, override dependencies to ensure the application uses mocks during tests:
  ```python
  app.dependency_overrides[get_service] = lambda: service
  ```

## 6. **Standardized Test Fixtures**

- **Autouse Fixtures for Setup/Teardown:**
  - Use autouse fixtures (e.g., setup_and_teardown) to automatically manage test state and clear dependency overrides.
  - Consider resetting mocks (via reset_mock()) between tests if the mock's state might leak across tests.

- **Custom Result Fixtures:**
  - Create helper fixtures to standardize the return values for CRUD operations.

## 7. **Environment and Test Data Management**

- **Mock Environment Variables:**
  - Use monkeypatch fixtures to ensure your tests run with a controlled environment:
  ```python
  @pytest.fixture(autouse=True)
  def mock_env_vars(monkeypatch):
      monkeypatch.setenv("MONGODB_URL", "mongodb://test:27017")
      monkeypatch.setenv("DATABASE_NAME", "test_db")
  ```

- **Consistent Test Data:**
  - Utilize shared utilities (e.g., in tests/utils/test_data.py) to create standardized test data for different collections.

- **Pre-populated Collections:**
  - For integration tests, use fixtures like mock_collections_with_data to simulate collections with initial data.

## 8. **Extending and Maintaining Mocks**

- **Adding New Collections/Operations:**
  - Follow existing patterns:
    - Use AsyncMock for asynchronous operations.
    - Apply PropertyMock for property access.
    - Ensure consistency in the return types and structure of your mocks.

- **Document Side Effects:**
  - Clearly document when and why you use side_effect to simulate errors.

- **Use Additional Tools if Needed:**
  - Consider using plugins like pytest-mock for cleaner syntax and more readable assertions.

## 9. **Additional Good Practices**

- **Test Coverage:**
  - Ensure all CRUD methods and error scenarios are covered by your tests.

- **Realism in Mocks:**
  - Mimic actual MongoDB behavior as closely as possible, including asynchronous behavior, to avoid surprises in production.

- **Refactoring and Maintenance:**
  - Regularly review and refactor your mocking strategy to align with any changes in the production code or database client library.

- **Logging & Debugging:**
  - Consider adding logging or verbose messages in fixtures to help diagnose test failures related to mocking.


## 10. **Additional Test Fixtures and Patterns**

### Collection Relationships and State
When collections have relationships (e.g. foreign keys):
```python
@pytest.fixture
async def mock_collections():
    """Setup collections with relationships."""
    primary_collection = AsyncMock()
    related_collection = AsyncMock()
    
    # Setup database relationship
    mock_db = AsyncMock()
    mock_db.get_collection = MagicMock(return_value=related_collection)
    type(primary_collection).database = PropertyMock(return_value=mock_db)
    
    # Assign to database setup
    mock_database_setup.primary = primary_collection
    mock_database_setup.related = related_collection
    
    return primary_collection, related_collection
```

### Test State Management
Always include an autouse fixture to manage test state:
```python
@pytest.fixture(autouse=True)
async def setup_and_teardown():
    """Setup and teardown for each test."""
    # Setup
    yield
    # Teardown - clear dependency overrides
    app.dependency_overrides = {}
```

### Collection Mocking Pattern
Use a fixture to set up collection mocks:
```python
@pytest.fixture
async def mock_collections():
    """Setup mock collections for tests."""
    collection = AsyncMock()
    
    # Mock the database property for nested collections
    mock_db = AsyncMock()
    type(collection).database = PropertyMock(return_value=mock_db)
    
    return collection
```

### Service Setup Pattern
Each test should set up its service with mocked dependencies:
```python
# Setup repository and service
repo = Repository(collection)
service = Service(mock_database_setup, repo)
app.dependency_overrides[get_service] = lambda: service
```

### Collection Operation Mocking
Mock MongoDB operations directly on the collection object:
```python
# Mock find_one operation
collection.find_one = AsyncMock(return_value=mock_doc)

# Mock find operation with cursor
mock_cursor = AsyncMock()
mock_cursor.to_list = AsyncMock(return_value=[mock_doc])
collection.find = MagicMock(return_value=mock_cursor)

# Mock insert operation
collection.insert_one = AsyncMock()

# Mock delete operation with count
mock_result = AsyncMock()
mock_result.deleted_count = 1
collection.delete_one = AsyncMock(return_value=mock_result)

# Mock update operation
mock_result = AsyncMock()
mock_result.modified_count = 1
collection.update_one = AsyncMock(return_value=mock_result)

# Mock error cases
collection.operation = AsyncMock(side_effect=Exception("Database error"))
```

All content is available under the Open Government Licence v3.0, except where otherwise stated

Support links