LLM-as-a-Judge: Best Practices for Consistent Evaluation

Post author By
Post date November 19, 2025

Research-Based Recommendations

Based on latest research from:
– EvidentlyAI LLM-as-a-Judge Guide (2024)
– Towards Data Science Practical Guide (2024)
– Cameron R. Wolfe’s Deep Learning Focus (2024)
– Databricks Best Practices for RAG Evaluation (2024)
– MT-Bench and Chatbot Arena papers

🎯 Key Principles for Consistent Results

1. Use Binary or Low-Precision Scoring

Why: Binary evaluations (Pass/Fail, Yes/No) are more reliable and consistent than high-precision scores (0-100).

Research Finding:

“Binary evaluations, like ‘Polite’ vs. ‘Impolite,’ tend to be more reliable and consistent for both LLMs and human evaluators. It’s easier to get accurate results with two simple choices rather than trying to decide if a specific response scores 73 vs. 82.” – EvidentlyAI

Recommendation:
– ✅ Best: Binary (Pass/Fail)
– ✅ Good: 3-point scale (Excellent/Acceptable/Poor)
– ✅ Acceptable: 5-point scale with clear rubric
– ⚠️ Avoid: 10-point or 100-point scales without examples

2. Provide Detailed Rubric with Examples

Why: Clear rubrics with concrete examples dramatically improve consistency.

Research Finding:

“Specifying well-defined rubrics and concrete examples is the key to ensuring the consistency and accuracy of LLM’s evaluation.” – Towards Data Science

Example Rubric Structure:

Score 3 (Excellent):
- Fulfills ALL requirements from prompt
- Clean, professional code structure
- Follows modern standards (HTML5/ES6+)
- Example: [provide actual code example]

Score 2 (Acceptable):
- Fulfills MOST requirements
- Code works but has minor issues
- Some outdated patterns
- Example: [provide actual code example]

Score 1 (Poor):
- Missing key requirements
- Multiple functional issues
- Poor code quality
- Example: [provide actual code example]

3. Split Complex Criteria into Separate Evaluations

Why: Evaluating one aspect at a time improves accuracy.

Research Finding:

“If you have several aspects to evaluate, like completeness, accuracy, and relevance, it’s best to split them into separate evaluators. This keeps things focused.” – EvidentlyAI

Implementation:

# Instead of one complex evaluation
overall_score = evaluate_all_aspects(code)

# Do separate evaluations
completeness = evaluate_completeness(code)
quality = evaluate_quality(code)
functionality = evaluate_functionality(code)

# Combine deterministically
overall_score = (completeness + quality + functionality) / 3

4. Use Chain-of-Thought (CoT) Prompting

Why: Asking for reasoning before scoring improves accuracy and explainability.

Research Finding:

“Combining LLM-as-a-Judge with CoT prompting is incredibly powerful. We should ask the model to output a rationale prior to generating a score.” – Cameron R. Wolfe

Prompt Structure:

1. First, analyze the code step-by-step
2. Then, provide your reasoning
3. Finally, give the score

Example:
"Analysis: The code implements a rotating cube using Three.js...
Reasoning: It fulfills the requirement but lacks error handling...
Score: 2 (Acceptable)"

5. Set Low Temperature for Consistency

Why: Lower temperature = more deterministic outputs.

Research Finding:

“To ensure that the results of LLM-as-a-Judge are (relatively) deterministic, we should use a low temperature setting (e.g., 0.1).” – Cameron R. Wolfe

Recommendations:
– Evaluation: temperature = 0.1 – 0.2
– Multiple samples: temperature = 0.3 – 0.5
– Avoid: temperature > 0.7 (too random)

6. Include Few-Shot Examples

Why: Examples calibrate the LLM’s scoring mechanism.

Research Finding:

“Providing more example evaluations in the prompt proves to be an effective mitigation strategy.” – Towards Data Science

Databricks Research:
– Zero-shot: Baseline accuracy
– Few-shot (1 example per score): +15-20% accuracy improvement
– Few-shot (2-3 examples per score): +25-30% accuracy improvement

Example Structure:

Here are examples of each score level:

SCORE 3 (Excellent):
Code: [full example]
Why: Implements all features, clean structure, no errors

SCORE 2 (Acceptable):  
Code: [full example]
Why: Works but missing one feature, minor issues

SCORE 1 (Poor):
Code: [full example]
Why: Multiple errors, incomplete implementation

7. Mitigate Known Biases

Common Biases:
1. Position bias: Favors first response in comparisons
2. Verbosity bias: Prefers longer responses
3. Self-preference bias: Favors own outputs
4. Inherited bias: From training data

Mitigation Strategies:

# 1. Explicit anti-bias instructions
prompt = """
Evaluate based ONLY on correctness and completeness.
Do NOT favor longer responses.
Do NOT consider response order.
"""

# 2. For pairwise: evaluate both directions
score_A_vs_B = judge(A, B)
score_B_vs_A = judge(B, A)
final_score = (score_A_vs_B + score_B_vs_A) / 2

# 3. Include diverse examples
examples = [
    short_good_response,
    long_good_response,
    short_bad_response,
    long_bad_response
]

8. Handle Uncertainty Explicitly

Why: Prevents overconfident wrong evaluations.

Research Finding:

“Explicitly encourage calibrated reasoning in the prompt. Tell the LLM to say ‘cannot determine’ if it lacks enough information.” – Towards Data Science

Implementation:

If you cannot verify a fact or lack sufficient information:
- Return score: -1 (Cannot Determine)
- Provide reason: "Insufficient information to evaluate X"
- These cases will be reviewed by humans

9. Use Structured JSON Output

Why: Enables automated processing and consistency checks.

Format:

{
  "score": 2,
  "reasoning": "Step-by-step analysis here",
  "strengths": ["Good Three.js setup", "Clean animation"],
  "weaknesses": ["Missing error handling", "No resize support"],
  "confidence": "high",
  "cannot_determine": []
}

10. Choose the Right Judge Model

Research Findings:

Large Frontier Models (GPT-4o, Claude, Gemini):
– ✅ Better correlation with humans
– ✅ Can follow complex prompts
– ⚠️ Higher cost
– ⚠️ Higher latency
– ⚠️ Data sent to third parties

Small Fine-Tuned Models (Llama-Judge, Phi-Judge):
– ✅ Lower cost
– ✅ Lower latency
– ✅ Data stays local
– ⚠️ May need domain-specific fine-tuning

Recommendation: Start with GPT-4o to establish baseline, then experiment with smaller models.

📋 Complete Prompt Template (Research-Based)

Based on all best practices above:

EVALUATION_PROMPT = """
You are an expert code quality evaluator with 10+ years of experience.

TASK: Evaluate HTML/JavaScript code quality using a clear rubric.

ORIGINAL REQUEST:
{prompt}

CODE TO EVALUATE:
{code}

EVALUATION CRITERIA:

Rate on a 3-point scale:

SCORE 3 (Excellent - Pass):
- Fulfills ALL requirements from original request
- Clean, professional code structure
- Follows modern standards (HTML5/ES6+)
- No critical errors or issues
- Example: [See Example 1 below]

SCORE 2 (Acceptable - Pass):
- Fulfills MOST requirements (80%+)
- Code works but has minor issues
- Some improvements possible
- No critical errors
- Example: [See Example 2 below]

SCORE 1 (Poor - Fail):
- Missing key requirements (<80%)
- Multiple functional issues
- Poor code quality or structure
- Critical errors present
- Example: [See Example 3 below]

EXAMPLE 1 (Score 3):
Request: "Create a rotating green cube with Three.js"
Code:
```html
<!DOCTYPE html>
<html>
<head>
    <title>Rotating Cube</title>
    <script src="https://cdn.jsdelivr.net/npm/three@0.150.0/build/three.min.js"></script>
</head>
<body style="margin:0">
    <script>
        const scene = new THREE.Scene();
        const camera = new THREE.PerspectiveCamera(75, window.innerWidth/window.innerHeight, 0.1, 1000);
        const renderer = new THREE.WebGLRenderer();
        renderer.setSize(window.innerWidth, window.innerHeight);
        document.body.appendChild(renderer.domElement);

        const geometry = new THREE.BoxGeometry();
        const material = new THREE.MeshBasicMaterial({color: 0x00ff00});
        const cube = new THREE.Mesh(geometry, material);
        scene.add(cube);

        camera.position.z = 5;

        window.addEventListener('resize', () => {
            camera.aspect = window.innerWidth / window.innerHeight;
            camera.updateProjectionMatrix();
            renderer.setSize(window.innerWidth, window.innerHeight);
        });

        function animate() {
            requestAnimationFrame(animate);
            cube.rotation.x += 0.01;
            cube.rotation.y += 0.01;
            renderer.render(scene, camera);
        }
        animate();
    </script>
</body>
</html>

Reasoning: Implements all requirements (rotating, green, cube, Three.js), includes resize handling, clean structure, modern practices.
Score: 3

EXAMPLE 2 (Score 2):
Request: “Create a rotating green cube with Three.js”
Code:

<!DOCTYPE html>
<html>
<head>
    <script src="https://cdnjs.cloudflare.com/ajax/libs/three.js/r128/three.min.js"></script>
</head>
<body>
    <script>
        var scene = new THREE.Scene();
        var camera = new THREE.PerspectiveCamera(75, window.innerWidth/window.innerHeight);
        var renderer = new THREE.WebGLRenderer();
        renderer.setSize(window.innerWidth, window.innerHeight);
        document.body.appendChild(renderer.domElement);

        var geometry = new THREE.BoxGeometry();
        var material = new THREE.MeshBasicMaterial({color: 0x00ff00});
        var cube = new THREE.Mesh(geometry, material);
        scene.add(cube);

        camera.position.z = 5;

        function animate() {
            requestAnimationFrame(animate);
            cube.rotation.x += 0.01;
            cube.rotation.y += 0.01;
            renderer.render(scene, camera);
        }
        animate();
    </script>
</body>
</html>

Reasoning: Implements core requirements but uses older Three.js version, var instead of const/let, missing resize handling, no title.
Score: 2

EXAMPLE 3 (Score 1):
Request: “Create a rotating green cube with Three.js”
Code:

<!DOCTYPE html>
<html>
<body>
    <script>
        var scene = new THREE.Scene();
        var cube = new THREE.Mesh(new THREE.BoxGeometry(), new THREE.MeshBasicMaterial());
        scene.add(cube);
        cube.rotation.x += 0.01;
    </script>
</body>
</html>

Reasoning: Missing Three.js import, no renderer, no camera, no animation loop, cube not green, doesn’t actually rotate.
Score: 1

INSTRUCTIONS:
1. Analyze the code step-by-step
2. Compare against the original request
3. Identify strengths and weaknesses
4. Provide clear reasoning
5. Assign score (1, 2, or 3)

IMPORTANT:
– Evaluate ONLY on correctness and completeness
– Do NOT favor longer responses
– Do NOT consider response order
– If you cannot determine quality due to insufficient information, return score: -1

Respond with JSON only:
{{
“score”: <1, 2, 3, or -1>,
“reasoning”: ““,
“strengths”: [““, ““],
“weaknesses”: [““, ““],
“confidence”: ““,
“pass”:
}}
“””


---

## 🔬 Validation Strategy

### Test Your Prompt

1. **Create test set** with known good/bad examples
2. **Run evaluation** multiple times (temperature=0.1)
3. **Check consistency**: Same input → same score?
4. **Check accuracy**: Scores match human judgment?
5. **Iterate**: Refine rubric and examples

### Metrics to Track

```python
# Consistency (run same input 5 times)
consistency_rate = (same_scores / total_runs) * 100
# Target: >95%

# Agreement with humans
agreement_rate = (matching_scores / total_examples) * 100
# Target: >85%

# Confidence calibration
high_confidence_accuracy = correct_predictions / high_confidence_predictions
# Target: >90%

📊 Comparison: Before vs After Best Practices

Before (Simple Approach)

"Rate this code 0-100"

❌ Vague criteria
❌ No examples
❌ High-precision scoring
❌ Inconsistent results
❌ No bias mitigation

Consistency: ~60%
Human Agreement: ~70%

After (Best Practices)

"Evaluate using 3-point rubric with examples:
Score 3: [detailed criteria + example]
Score 2: [detailed criteria + example]
Score 1: [detailed criteria + example]
Provide step-by-step reasoning..."

✅ Clear rubric
✅ Concrete examples
✅ Low-precision scoring
✅ CoT reasoning
✅ Bias mitigation

Consistency: ~92%
Human Agreement: ~88%

🎯 Summary Checklist

For consistent LLM-as-a-Judge results:

[ ] Use 3-point scale (not 100-point)
[ ] Provide detailed rubric for each score
[ ] Include 1-2 examples per score level
[ ] Use Chain-of-Thought prompting
[ ] Set temperature to 0.1-0.2
[ ] Split complex criteria into separate evaluations
[ ] Add explicit anti-bias instructions
[ ] Handle uncertainty with “cannot determine” option
[ ] Use structured JSON output
[ ] Test consistency with same inputs
[ ] Validate against human judgments
[ ] Choose appropriate judge model (GPT-4o recommended)

📚 References

EvidentlyAI: LLM-as-a-judge: a complete guide
Towards Data Science: LLM-as-a-Judge: A Practical Guide
Cameron R. Wolfe: Using LLMs for Evaluation
Databricks: Best Practices for LLM Evaluation of RAG Applications
MT-Bench Paper: Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Vicuna: Pairwise comparison methodology

This guide synthesizes the latest research (2024) on LLM-as-a-Judge best practices for consistent, reliable evaluation.

Tags AI-judge, best-practices, evaluation, LLM

RAG

TOON Python: Efficient Data Encoding for Large Language Models

Post author By praison
Post date November 17, 2025

TOON Python Examples: Efficient Data Encoding for LLMs

TOON (Token-Oriented Object Notation) is a compact, human-readable format designed for passing structured data to Large Language Models with significantly reduced token usage (30-60% reduction compared to JSON).

Installation

pip install toon-python

Example 1: Basic Encoding

Simple example showing basic TOON encoding:

from toon_python import encode

data = {"name": "Alice", "age": 30}
toon_output = encode(data)
print(toon_output)

Output:

name: Alice
age: 30

Example 2: Encoding with Custom Options

Demonstrates custom formatting options including indent, delimiter, and length marker:

from toon_python import encode, EncodeOptions, Delimiter

data = {"name": "Alice", "age": 30}
toon_output = encode(data)
print(toon_output)

options = EncodeOptions(
    indent=4,
    delimiter=Delimiter.PIPE,
    length_marker="#",
)

tags_data = {"tags": ["python", "llm", "data"]}
toon_output_with_options = encode(tags_data, options)
print("\nEncoding with custom options:")
print(toon_output_with_options)

Output:

name: Alice
age: 30

Encoding with custom options:
tags#[3]: python|llm|data

Example 3: Advanced Types and Nested Structures

Comprehensive example showing nested objects, arrays, and Python-specific types:

from toon_python import encode, EncodeOptions, Delimiter
from datetime import datetime, date
from decimal import Decimal
from uuid import UUID

# Nested structures and arrays
user_data = {
    "user": {
        "id": 123,
        "name": "Ada Lovelace",
        "active": True,
        "roles": ["admin", "user"],
    },
    "items": [1, 2, 3, 4, 5],
}
toon_user_output = encode(user_data)
print("Nested structures and arrays:")
print(toon_user_output)

# Advanced types
advanced_data = {
    "timestamp": datetime(2023, 1, 1, 12, 0, 0),
    "date_only": date(2023, 1, 1),
    "price": Decimal("19.99"),
    "id": UUID("12345678-1234-5678-1234-567812345678"),
    "binary": b"hello world",
    "users": [
        {"id": 1, "name": "Alice"},
        {"id": 2, "name": "Bob"},
    ],
    "mixed": [1, {"a": 2}, "three"],
}
toon_advanced_output = encode(advanced_data)
print("\nAdvanced types and optimized arrays:")
print(toon_advanced_output)

Output:

Nested structures and arrays:
user:
  id: 123
  name: Ada Lovelace
  active: true
  roles[2]: admin,user
items[5]: 1,2,3,4,5

Advanced types and optimized arrays:
timestamp: "2023-01-01T12:00:00"
date_only: 2023-01-01
price: "19.99"
id: 12345678-1234-5678-1234-567812345678
binary: aGVsbG8gd29ybGQ=
users[2]{id,name}:
  1,Alice
  2,Bob
mixed[3]:
  - 1
  a: 2
  - three

Key Features

Token Efficiency: 30-60% reduction compared to JSON
LLM-Friendly: Explicit lengths and field lists help models validate output
Minimal Syntax: Removes redundant punctuation (braces, brackets, most quotes)
Type Support: Handles Python-specific types (datetime, Decimal, UUID, bytes)
Flexible Formatting: Configurable indentation, delimiters, and length markers
Pure Python: No runtime dependencies

Resources

RAG

Nexa AI Beginners Guide

Post author By praison
Post date November 7, 2025

Install Nexa SDK

https://github.com/NexaAI/nexa-sdk

nexa infer NexaAI/Qwen3-VL-4B-Instruct-GGUF

Install requirements

pip install openai chainlit chromadb PyPDF2 sentence-transformers

Basic

from openai import OpenAI

client = OpenAI(base_url="http://127.0.0.1:18181/v1", api_key="nexa")

completion = client.chat.completions.create(
    model="NexaAI/Qwen3-VL-4B-Instruct-GGUF",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Give me a meal plan for today."}
    ],
    temperature=0.7,
)

print(completion.choices[0].message.content)

Streaming Response

from openai import OpenAI

client = OpenAI(base_url="http://127.0.0.1:18181/v1", api_key="nexa")

response = client.chat.completions.create(
    model="NexaAI/Qwen3-VL-4B-Instruct-GGUF",
    messages=[
        {"role": "system", "content": "You are helpful assistant."},
        {"role": "user", "content": "Give me a meal plan for today."},
    ],
    temperature=0.7,
    stream=True
)

for chunk in response:
    if chunk.choices[0].delta.content is not None:
        print(chunk.choices[0].delta.content, end="")

UI with Chainlit

import chainlit as cl
from openai import AsyncOpenAI

# Configure the async OpenAI client
client = AsyncOpenAI(api_key="nexa", base_url="http://127.0.0.1:18181/v1")

settings = {
    "model": "NexaAI/Qwen3-VL-4B-Instruct-GGUF",
    "temperature": 0.7,
    "max_tokens": 500,
    "top_p": 1,
    "frequency_penalty": 0,
    "presence_penalty": 0
}

@cl.on_chat_start
def start_chat():
    # Initialize message history
    cl.user_session.set("message_history", [{"role": "system", "content": "You are a helpful chatbot."}])

@cl.on_message
async def main(message: cl.Message):
    # Retrieve the message history from the session
    message_history = cl.user_session.get("message_history")
    message_history.append({"role": "user", "content": message.content})

    # Create an initial empty message to send back to the user
    msg = cl.Message(content="")
    await msg.send()

    # Use streaming to handle partial responses
    stream = await client.chat.completions.create(messages=message_history, stream=True, **settings)

    async for part in stream:
        if token := part.choices[0].delta.content or "":
            await msg.stream_token(token)

    # Append the assistant's last response to the history
    message_history.append({"role": "assistant", "content": msg.content})
    cl.user_session.set("message_history", message_history)

    # Update the message after streaming completion
    await msg.update()

RAG (Retrieval Augmented Generation)

import chainlit as cl
from openai import AsyncOpenAI
import chromadb
from chromadb.config import Settings
from PyPDF2 import PdfReader
from sentence_transformers import SentenceTransformer

client = AsyncOpenAI(api_key="nexa", base_url="http://127.0.0.1:18181/v1")
embedding_model = SentenceTransformer('all-MiniLM-L6-v2')

settings = {
    "model": "NexaAI/Qwen3-VL-4B-Instruct-GGUF",
    "temperature": 0.7,
    "max_tokens": 500,
    "top_p": 1,
    "frequency_penalty": 0,
    "presence_penalty": 0
}

chroma_client = chromadb.Client(Settings(anonymized_telemetry=False))

def get_embedding(text):
    return embedding_model.encode(text).tolist()

def chunk_text(text, chunk_size=500):
    words = text.split()
    chunks = []
    for i in range(0, len(words), chunk_size):
        chunk = " ".join(words[i:i + chunk_size])
        chunks.append(chunk)
    return chunks

@cl.on_chat_start
async def start_chat():
    cl.user_session.set("message_history", [{"role": "system", "content": "You are a helpful assistant. Answer questions based on the provided context."}])
    
    files = None
    while files is None:
        files = await cl.AskFileMessage(
            content="Please upload a text file to begin!",
            accept=["text/plain", "application/pdf", "text/markdown"],
            max_size_mb=20,
            timeout=180,
        ).send()
    
    file = files[0]
    msg = cl.Message(content=f"Processing `{file.name}`...")
    await msg.send()
    
    if file.name.endswith('.pdf'):
        reader = PdfReader(file.path)
        text = ""
        for page in reader.pages:
            text += page.extract_text() + "\n"
    else:
        with open(file.path, "r", encoding="utf-8") as f:
            text = f.read()
    
    chunks = chunk_text(text)
    
    try:
        collection = chroma_client.get_collection(name="documents")
        chroma_client.delete_collection(name="documents")
    except Exception:
        pass
    
    collection = chroma_client.create_collection(name="documents")
    
    for i, chunk in enumerate(chunks):
        embedding = get_embedding(chunk)
        collection.add(
            embeddings=[embedding],
            documents=[chunk],
            ids=[f"chunk_{i}"]
        )
    
    cl.user_session.set("collection", collection)
    
    msg.content = f"Processing `{file.name}` done. Indexed {len(chunks)} chunks. You can now ask questions!"
    await msg.update()

@cl.on_message
async def main(message: cl.Message):
    collection = cl.user_session.get("collection")
    message_history = cl.user_session.get("message_history")
    
    query_embedding = get_embedding(message.content)
    
    results = collection.query(
        query_embeddings=[query_embedding],
        n_results=3
    )
    
    context = "\n\n".join(results["documents"][0])
    
    temp_history = message_history.copy()
    enhanced_message = f"Context:\n{context}\n\nQuestion: {message.content}"
    temp_history.append({"role": "user", "content": enhanced_message})

    msg = cl.Message(content="")
    await msg.send()

    stream = await client.chat.completions.create(messages=temp_history, stream=True, **settings)

    async for part in stream:
        if token := part.choices[0].delta.content or "":
            await msg.stream_token(token)

    message_history.append({"role": "user", "content": message.content})
    message_history.append({"role": "assistant", "content": msg.content})
    cl.user_session.set("message_history", message_history)

    await msg.update()

WordPress Development

The Complete Guide to Creating and Publishing a WordPress Plugin

Post author By praison
Post date November 4, 2025

A comprehensive guide based on real-world experience publishing a plugin to WordPress.org, covering everything from development to approval.

Initial Setup & Structure
Core Development Requirements
Security Best Practices
WordPress.org Submission Requirements
Common Review Issues & Solutions
SVN Publishing Process
Post-Launch Maintenance

Initial Setup & Structure

Essential Files

Every WordPress plugin needs these core files:

your-plugin/
├── your-plugin.php          # Main plugin file
├── readme.txt               # WordPress.org readme
├── README.md                # GitHub readme (optional)
├── css/
│   └── styles.css
├── js/
│   └── scripts.js
├── languages/               # Translation files
└── assets/                  # Plugin directory assets (banners, icons)

Main Plugin File Header

Your main PHP file must include a proper header:

<?php
/**
 * Plugin Name: Your Plugin Name
 * Plugin URI:  https://wordpress.org/plugins/your-plugin/
 * Description: A clear, concise description of what your plugin does.
 * Version:     1.0.0
 * Author:      Your Name or Company
 * Author URI:  https://your-website.com/
 * License:     GPL-2.0-or-later
 * License URI: https://www.gnu.org/licenses/gpl-2.0.html
 * Text Domain: your-plugin
 * Domain Path: /languages
 */

// Prevent direct access
if (!defined('ABSPATH')) {
    exit;
}

readme.txt Structure

The readme.txt file determines how your plugin appears on WordPress.org:

=== Plugin Name ===
Contributors: yourusername
Donate link: https://your-website.com/
Tags: tag1, tag2, tag3, tag4, tag5
Requires at least: 5.0
Tested up to: 6.8
Stable tag: 1.0.0
License: GPLv2 or later
License URI: https://www.gnu.org/licenses/gpl-2.0.html

Short description (max 150 characters)

== Description ==

Detailed description of your plugin...

== Installation ==

1. Upload the plugin files...
2. Activate the plugin...
3. Configure settings...

== Frequently Asked Questions ==

= Question 1 =
Answer 1

== Screenshots ==

1. Screenshot description
2. Another screenshot description

== Changelog ==

= 1.0.0 =
* Initial release

Important:

Maximum 5 tags
Keep “Tested up to” current (within 3 major versions)
Use proper markdown formatting

Core Development Requirements

1. Proper Asset Enqueuing

❌ NEVER do this:

echo '<script src="..."></script>';
echo '<link rel="stylesheet" href="...">';

✅ ALWAYS do this:

// Enqueue frontend scripts
function myplugin_enqueue_scripts() {
    wp_enqueue_style(
        'myplugin-style',
        plugin_dir_url(__FILE__) . 'css/style.css',
        array(),
        '1.0.0'
    );
    
    wp_enqueue_script(
        'myplugin-script',
        plugin_dir_url(__FILE__) . 'js/script.js',
        array('jquery'),
        '1.0.0',
        true  // Load in footer
    );
}
add_action('wp_enqueue_scripts', 'myplugin_enqueue_scripts');

// Enqueue admin scripts
function myplugin_admin_scripts($hook) {
    // Only load on your plugin's admin page
    if ($hook !== 'settings_page_myplugin') {
        return;
    }
    
    wp_enqueue_script(
        'myplugin-admin',
        plugin_dir_url(__FILE__) . 'js/admin.js',
        array('jquery'),
        '1.0.0',
        true
    );
}
add_action('admin_enqueue_scripts', 'myplugin_admin_scripts');

2. AJAX Implementation

// Register AJAX handlers
add_action('wp_ajax_myplugin_action', 'myplugin_ajax_handler');
add_action('wp_ajax_nopriv_myplugin_action', 'myplugin_ajax_handler');

function myplugin_ajax_handler() {
    // Verify nonce
    check_ajax_referer('myplugin_nonce', 'nonce');
    
    // Sanitize input
    $data = sanitize_text_field(wp_unslash($_POST['data']));
    
    // Process and respond
    wp_send_json_success(array('message' => 'Success'));
}

// Localize script with AJAX URL and nonce
function myplugin_localize_script() {
    wp_localize_script('myplugin-script', 'mypluginAjax', array(
        'ajaxurl' => admin_url('admin-ajax.php'),
        'nonce'   => wp_create_nonce('myplugin_nonce')
    ));
}
add_action('wp_enqueue_scripts', 'myplugin_localize_script');

3. Settings API

// Register settings
function myplugin_register_settings() {
    register_setting(
        'myplugin_settings_group',
        'myplugin_api_key',
        'myplugin_sanitize_api_key'  // Sanitization callback
    );
}
add_action('admin_init', 'myplugin_register_settings');

// Sanitization callback
function myplugin_sanitize_api_key($input) {
    return sanitize_text_field($input);
}

// Add settings page
function myplugin_add_admin_menu() {
    add_options_page(
        'My Plugin Settings',
        'My Plugin',
        'manage_options',
        'myplugin',
        'myplugin_settings_page'
    );
}
add_action('admin_menu', 'myplugin_add_admin_menu');

Security Best Practices

1. Input Sanitization

Always sanitize user input:

// Text fields
$text = sanitize_text_field($_POST['text']);

// Textareas
$textarea = sanitize_textarea_field($_POST['textarea']);

// Email
$email = sanitize_email($_POST['email']);

// URL
$url = esc_url_raw($_POST['url']);

// Integer
$number = absint($_POST['number']);

// Array
$array = array_map('sanitize_text_field', $_POST['array']);

2. Output Escaping

Always escape output:

// HTML attributes
echo '<input value="' . esc_attr($value) . '">';

// HTML content
echo '<p>' . esc_html($content) . '</p>';

// URLs
echo '<a href="' . esc_url($url) . '">Link</a>';

// Textarea
echo '<textarea>' . esc_textarea($content) . '</textarea>';

// JavaScript
echo '<script>var data = ' . wp_json_encode($data) . ';</script>';

3. Nonce Verification

// Create nonce
wp_nonce_field('myplugin_action', 'myplugin_nonce');

// Verify nonce
if (!isset($_POST['myplugin_nonce']) || 
    !wp_verify_nonce($_POST['myplugin_nonce'], 'myplugin_action')) {
    wp_die('Security check failed');
}

4. Capability Checks

// Check user permissions
if (!current_user_can('manage_options')) {
    wp_die('Unauthorized access');
}

5. SQL Queries

global $wpdb;

// ❌ NEVER do this (SQL injection risk)
$results = $wpdb->get_results("SELECT * FROM table WHERE id = {$_GET['id']}");

// ✅ ALWAYS use prepared statements
$results = $wpdb->get_results($wpdb->prepare(
    "SELECT * FROM {$wpdb->prefix}table WHERE id = %d",
    absint($_GET['id'])
));

WordPress.org Submission Requirements

1. External Services Documentation

If your plugin connects to external APIs, you MUST document it in readme.txt:

== External Services ==

This plugin connects to the [Service Name] API to provide [functionality].

**Service Used:** Service Name (https://api.example.com/)

**Purpose:** The plugin sends [what data] to [service] to [why].

**Data Sent:** When a user [action], the following data is transmitted:
- User's [data type 1]
- [data type 2]

**When Data is Sent:** Data is sent to [service] only when:
- A user actively [action 1]
- [condition 2]

**Privacy & Terms:**
- Privacy Policy: https://example.com/privacy
- Terms of Use: https://example.com/terms
- API Data Usage: https://example.com/api-usage

This is critical! Failure to document external services will result in rejection.

2. Ownership Verification

If your plugin name or author URI references a domain that doesn’t match your WordPress.org email, you must:

Use an email from that domain, OR
Add a public declaration on the website

Example footer text:

"[Plugin Name] is owned and operated by [Your Name]"

Add this to the footer or a dedicated page on both domains mentioned in your plugin.

3. No Inline Scripts or Styles

All JavaScript and CSS must be in separate files and properly enqueued. No exceptions.

4. Proper File Structure

your-plugin/
├── trunk/              # Current development version
│   ├── css/
│   ├── js/
│   ├── languages/
│   ├── your-plugin.php
│   └── readme.txt
├── tags/               # Released versions
│   ├── 1.0.0/
│   └── 1.0.1/
├── assets/             # Plugin directory assets
│   ├── banner-772x250.png
│   ├── banner-1544x500.png
│   ├── icon-128x128.png
│   └── icon-256x256.png
└── branches/           # Development branches (optional)

Common Review Issues & Solutions

Issue 1: Inline Scripts

Problem:

echo '<script>alert("Hello");</script>';

Solution:

// Create separate JS file: js/myplugin.js
// alert("Hello");

// Enqueue it properly
wp_enqueue_script('myplugin-js', plugin_dir_url(__FILE__) . 'js/myplugin.js');

Issue 2: Missing Sanitization

Problem:

update_option('myplugin_setting', $_POST['setting']);

Solution:

register_setting('myplugin_group', 'myplugin_setting', 'sanitize_text_field');

Issue 3: Direct File Access

Problem: Plugin files can be accessed directly via URL.

Solution: Add to every PHP file:

if (!defined('ABSPATH')) {
    exit;
}

Issue 4: Hardcoded Paths

Problem:

include('/var/www/html/wp-content/plugins/myplugin/file.php');

Solution:

include(plugin_dir_path(__FILE__) . 'file.php');

Issue 5: Missing Text Domain

Problem:

echo __('Hello');

Solution:

echo __('Hello', 'myplugin');

SVN Publishing Process

Initial Setup

Set SVN Password
- Go to: https://profiles.wordpress.org/me/profile/edit/group/3/?screen=svn-password
- Create an SVN password (different from your login password)
Checkout Repository

svn co https://plugins.svn.wordpress.org/your-plugin/ your-plugin-svn

Publishing Your Plugin

# 1. Copy files to trunk
cp -r /path/to/your-plugin/* your-plugin-svn/trunk/

# 2. Add files to SVN
cd your-plugin-svn
svn add trunk/* --force

# 3. Commit to trunk
svn ci -m "Initial release v1.0.0" --username yourusername

# 4. Create a tag
svn cp trunk tags/1.0.0

# 5. Commit the tag
svn ci -m "Tagging version 1.0.0" --username yourusername

Updating Your Plugin

# 1. Update trunk
cp -r /path/to/updated-files/* your-plugin-svn/trunk/

# 2. Commit changes
svn ci -m "Update to version 1.0.1" --username yourusername

# 3. Create new tag
svn cp trunk tags/1.0.1

# 4. Commit the tag
svn ci -m "Tagging version 1.0.1" --username yourusername

Adding Assets

# Add banner and icon to assets folder
cp banner-1544x500.png your-plugin-svn/assets/
cp icon-256x256.png your-plugin-svn/assets/

cd your-plugin-svn/assets
svn add *.png
svn ci -m "Add plugin assets" --username yourusername

Asset Specifications:

Banner: 1544×500px (retina) or 772×250px
Icon: 256×256px (retina) or 128×128px
Screenshots: 1200×900px recommended
Format: PNG or JPG

Post-Launch Maintenance

1. Monitor Support Forum

Check daily: https://wordpress.org/support/plugin/your-plugin/
Respond promptly to user questions
Mark resolved topics as resolved

2. Regular Updates

Keep “Tested up to” version current
Update at least every 6 months
Test with latest WordPress version

3. Security Updates

If a security issue is found:

Fix immediately
Increment version number
Update changelog with “Security fix”
Push to SVN
Consider notifying users

4. Version Control Best Practices

Semantic Versioning:

1.0.0 – Major release
1.1.0 – Minor release (new features)
1.0.1 – Patch release (bug fixes)

Changelog Format:

== Changelog ==

= 1.0.1 =
* Fixed: Bug description
* Improved: Feature description

= 1.0.0 =
* Initial release

5. Monitoring Tools

Use these tools regularly:

Plugin Check – https://wordpress.org/plugins/plugin-check/wp plugin check your-plugin
PHPCS + WPCS – WordPress Coding Standardsphpcs --standard=WordPress your-plugin/
Query Monitor – Debug queries and performance
- Install: https://wordpress.org/plugins/query-monitor/

Pre-Submission Checklist

Before submitting to WordPress.org, verify:

Code Quality

[ ] All scripts/styles properly enqueued
[ ] No inline scripts or styles
[ ] All input sanitized
[ ] All output escaped
[ ] Nonces used for forms
[ ] Capability checks in place
[ ] No direct file access possible
[ ] Text domain matches plugin slug
[ ] No PHP errors with WP_DEBUG enabled

Documentation

[ ] readme.txt properly formatted
[ ] External services documented (if applicable)
[ ] Installation instructions clear
[ ] FAQ section helpful
[ ] Changelog up to date
[ ] Screenshots added (if applicable)

Security

[ ] SQL queries use prepared statements
[ ] No eval() or base64_decode()
[ ] No remote file inclusion
[ ] API keys stored securely
[ ] User permissions checked

Ownership

[ ] Email matches domain OR
[ ] Ownership declaration on website
[ ] Author URI accessible
[ ] Plugin URI valid

Testing

[ ] Tested on clean WordPress install
[ ] Tested with default theme
[ ] No conflicts with popular plugins
[ ] Works with latest WordPress version
[ ] Mobile responsive (if applicable)

Common Mistakes to Avoid

1. Using Outdated Functions

❌ Avoid:

mysql_query()  // Deprecated
$wpdb->escape()  // Use $wpdb->prepare()

✅ Use:

$wpdb->prepare()
wp_remote_get()
wp_remote_post()

2. Not Following WordPress Coding Standards

Use proper:

Indentation (tabs, not spaces)
Naming conventions (snake_case for functions)
File organization
Comment formatting

3. Forgetting Uninstall Cleanup

Create uninstall.php:

<?php
if (!defined('WP_UNINSTALL_PLUGIN')) {
    exit;
}

// Delete options
delete_option('myplugin_settings');

// Delete custom tables
global $wpdb;
$wpdb->query("DROP TABLE IF EXISTS {$wpdb->prefix}myplugin_table");

// Clear any cached data
wp_cache_flush();

4. Hardcoding URLs

❌ Don’t:

$url = 'http://example.com/wp-content/plugins/myplugin/';

✅ Do:

$url = plugin_dir_url(__FILE__);

5. Not Preparing for Internationalization

// ❌ Don't
echo 'Hello World';

// ✅ Do
echo __('Hello World', 'myplugin');
echo _e('Hello World', 'myplugin');  // Echo version
printf(__('Hello %s', 'myplugin'), $name);

Useful Resources

Real-World Example: PraisonAI Plugin Journey

Here’s what we learned from successfully publishing the PraisonAI plugin:

Initial Submission Issues

Ownership Verification
- Problem: Email domain didn’t match plugin domain
- Solution: Added footer text to both websites
Inline Scripts
- Problem: Used <script> tags in PHP
- Solution: Created separate JS file and enqueued properly
Missing API Documentation
- Problem: Didn’t document OpenAI API usage
- Solution: Added comprehensive “External Services” section

Time to Approval

Initial Submission: Day 1
First Review: Day 2 (automated pre-review)
Manual Review: Day 3
Approval: Day 4
Total Time: 4 days

Key Takeaways

Read the guidelines thoroughly before starting
Test with Plugin Check before submitting
Document everything clearly
Respond promptly to review feedback
Be patient – reviewers are volunteers

Conclusion

Creating a WordPress plugin that gets approved requires:

Clean, secure code following WordPress standards
Proper documentation of all features and external services
Thorough testing before submission
Quick response to review feedback
Ongoing maintenance after approval

The review process exists to protect WordPress users and maintain quality standards. By following these guidelines, you’ll save time and increase your chances of quick approval.

Remember: The WordPress.org Plugin Directory serves millions of users. Your plugin represents not just your work, but the quality of the entire ecosystem.

Good luck with your plugin development! 🚀

Quick Reference Commands

SVN Commands

# Checkout
svn co https://plugins.svn.wordpress.org/your-plugin/

# Add files
svn add file.php

# Commit
svn ci -m "Commit message" --username yourusername

# Create tag
svn cp trunk tags/1.0.0

# Update
svn up

# Status
svn status

# Diff
svn diff

WordPress CLI

# Check plugin
wp plugin check your-plugin

# Activate plugin
wp plugin activate your-plugin

# Deactivate plugin
wp plugin deactivate your-plugin

# List plugins
wp plugin list

# Update WordPress
wp core update

Git + SVN Workflow

# 1. Develop in Git
git add .
git commit -m "Add feature"
git push origin main

# 2. Copy to SVN trunk
cp -r git-repo/* svn-repo/trunk/

# 3. Commit to SVN
cd svn-repo
svn ci -m "Update to v1.0.1"

# 4. Tag release
svn cp trunk tags/1.0.1
svn ci -m "Tagging v1.0.1"

CodeEditor

Fix Windsurf CLI Commands Not Running on macOS

Post author By praison
Post date November 3, 2025

If your CLI commands in Windsurf stop executing or hang on macOS, the issue is usually caused by zsh conflicts inside Cascade.
The quick fix is to make Windsurf use bash instead of zsh for its internal terminal.

Add these lines to your settings.json:

"terminal.integrated.profiles.osx": {
  "bash": {
    "path": "/bin/bash",
    "args": ["--noprofile", "--norc"]
  }
},
"terminal.integrated.defaultProfile.osx": "bash"

Then restart Windsurf and verify with:

echo $0

You should see /bin/bash — commands will now run smoothly again.

MCP

Beginner’s Guide: Hello World MCP Server in Next.js

Post author By praison
Post date October 24, 2025

🚀

What is MCP?

MCP (Model Context Protocol) is a way to give AI assistants access to your application’s data and functionality. Think of it as creating “tools” that an AI can use to help users.

Simple Analogy:

Your app is like a toolbox 🧰
MCP tools are like individual tools (hammer, screwdriver, etc.)
The AI is like a smart assistant that knows which tool to use

🎯 What We’ll Build

A simple “Hello World” MCP server that:

Has a basic API endpoint
Provides data to an AI assistant
Lets the AI answer questions about your app

Time to complete: 15 minutes ⏱️

📋 Prerequisites

You need:

✅ Node.js 18+ installed
✅ Basic knowledge of JavaScript/TypeScript
✅ A text editor (VS Code recommended)
✅ Terminal/Command line access

🏗️ Step-by-Step Tutorial

Step 1: Create a New Next.js Project

# Create a new Next.js app
npx create-next-app@latest my-mcp-app

# When prompted, choose:
# ✅ TypeScript: Yes
# ✅ ESLint: Yes
# ✅ Tailwind CSS: Yes
# ✅ src/ directory: No
# ✅ App Router: Yes (IMPORTANT!)
# ✅ Turbopack: No
# ✅ Import alias: Yes (@/*)

# Navigate to the project
cd my-mcp-app

What just happened?

Created a new Next.js project with App Router
App Router is needed for MCP (uses /app directory)

Step 2: Enable MCP in Next.js Config

Open next.config.ts and enable MCP:

// next.config.ts
import type { NextConfig } from "next";

const nextConfig: NextConfig = {
  experimental: {
    // Enable MCP server
    mcp: {
      enabled: true,
      port: 3000,
    },
  },
};

export default nextConfig;

What does this do?

Enables the MCP server in Next.js
Sets the port to 3000 (default)

Step 3: Create Your First MCP Endpoint

Create a simple API endpoint that returns data:

# Create the API directory structure
mkdir -p app/api/hello

Create app/api/hello/route.ts:

// app/api/hello/route.ts
import { NextResponse } from 'next/server';

export async function GET() {
  // This is your data that the AI can access
  const data = {
    message: "Hello from MCP!",
    timestamp: new Date().toISOString(),
    tips: [
      "MCP lets AI access your app data",
      "You can create multiple endpoints",
      "AI can call these endpoints automatically"
    ]
  };
  
  return NextResponse.json(data);
}

What does this do?

Creates an API endpoint at /api/hello
Returns JSON data that includes a message and tips
This data will be available to the AI

Step 4: Test Your Endpoint

Start the development server:

npm run dev

Open your browser and visit:

http://localhost:3000/api/hello

You should see:

{
  "message": "Hello from MCP!",
  "timestamp": "2025-10-25T00:00:00.000Z",
  "tips": [
    "MCP lets AI access your app data",
    "You can create multiple endpoints",
    "AI can call these endpoints automatically"
  ]
}

✅ Success! Your first MCP endpoint is working!

Step 5: Install AI SDK

Now let’s add AI capabilities:

# Install Vercel AI SDK and OpenAI
npm install ai @ai-sdk/openai zod

What are these packages?

ai – Vercel AI SDK for building AI apps
@ai-sdk/openai – OpenAI integration
zod – Schema validation (for tool parameters)

Step 6: Create MCP Tools

Create a file to define your MCP tools:

# Create lib directory
mkdir -p lib

Create lib/mcp-tools.ts:

// lib/mcp-tools.ts
import { tool } from 'ai';
import { z } from 'zod';

// Define your MCP tools
export const mcpTools = {
  // Tool 1: Get hello message
  get_hello_message: tool({
    description: 'Get a hello message from the server',
    parameters: z.object({}), // No parameters needed
    execute: async () => {
      // Fetch data from your API
      const response = await fetch('http://localhost:3000/api/hello');
      const data = await response.json();
      return data;
    },
  }),
  
  // Tool 2: Get current time
  get_current_time: tool({
    description: 'Get the current server time',
    parameters: z.object({}),
    execute: async () => {
      return {
        time: new Date().toLocaleTimeString(),
        date: new Date().toLocaleDateString(),
        timezone: Intl.DateTimeFormat().resolvedOptions().timeZone,
      };
    },
  }),
};

What does this do?

Defines 2 tools that the AI can use
get_hello_message – Fetches data from your API
get_current_time – Returns current time
Each tool has a description (tells AI what it does)
Each tool has parameters (inputs the AI can provide)
Each tool has an execute function (what it actually does)

Step 7: Create AI Chat Endpoint

Create app/api/chat/route.ts:

// app/api/chat/route.ts
import { openai } from '@ai-sdk/openai';
import { streamText } from 'ai';
import { mcpTools } from '@/lib/mcp-tools';

export async function POST(request: Request) {
  // Get the user's message
  const { messages } = await request.json();
  
  // Call OpenAI with MCP tools
  const result = await streamText({
    model: openai('gpt-4o-mini'),
    messages,
    tools: mcpTools, // Give AI access to your tools
    maxSteps: 5, // Allow AI to use multiple tools
  });
  
  // Stream the response back to the user
  return result.toDataStreamResponse();
}

What does this do?

Creates a chat endpoint at /api/chat
Connects to OpenAI’s GPT-4o-mini model
Gives the AI access to your MCP tools
Streams responses back to the user in real-time

Step 8: Create a Simple Chat UI

Create app/page.tsx:

// app/page.tsx
'use client';

import { useChat } from 'ai/react';

export default function Home() {
  const { messages, input, handleInputChange, handleSubmit, isLoading } = useChat();

  return (
    <div className="flex flex-col h-screen max-w-2xl mx-auto p-4">
      <h1 className="text-3xl font-bold mb-4">Hello World MCP Chat</h1>
      
      {/* Messages */}
      <div className="flex-1 overflow-y-auto mb-4 space-y-4">
        {messages.map((message) => (
          <div
            key={message.id}
            className={`p-4 rounded-lg ${
              message.role === 'user'
                ? 'bg-blue-500 text-white ml-auto'
                : 'bg-gray-200 text-black'
            } max-w-[80%]`}
          >
            <p className="font-semibold mb-1">
              {message.role === 'user' ? 'You' : 'AI'}
            </p>
            <p>{message.content}</p>
            
            {/* Show which tools were used */}
            {message.toolInvocations && message.toolInvocations.length > 0 && (
              <div className="mt-2 text-xs opacity-75">
                🔧 Used tools: {message.toolInvocations.map(t => t.toolName).join(', ')}
              </div>
            )}
          </div>
        ))}
        
        {isLoading && (
          <div className="bg-gray-200 text-black p-4 rounded-lg">
            <p className="animate-pulse">AI is thinking...</p>
          </div>
        )}
      </div>
      
      {/* Input */}
      <form onSubmit={handleSubmit} className="flex gap-2">
        <input
          value={input}
          onChange={handleInputChange}
          placeholder="Ask me anything..."
          className="flex-1 p-3 border rounded-lg"
        />
        <button
          type="submit"
          disabled={isLoading}
          className="px-6 py-3 bg-blue-500 text-white rounded-lg hover:bg-blue-600 disabled:opacity-50"
        >
          Send
        </button>
      </form>
    </div>
  );
}

What does this do?

Creates a simple chat interface
Shows messages from user and AI
Displays which MCP tools the AI used
Has an input field to type messages

Step 9: Add Your OpenAI API Key

Create .env.local:

# .env.local
OPENAI_API_KEY=your_api_key_here

How to get an API key:

Go to https://platform.openai.com/api-keys
Sign in or create an account
Click “Create new secret key”
Copy the key and paste it in .env.local

⚠️ Important: Never commit .env.local to git!

Step 10: Test Your MCP Chat!

Start the server:

npm run dev

Open your browser:

http://localhost:3000

Try these questions:

Question 1:

What's the hello message?

AI Response:

The hello message is "Hello from MCP!" 
Here are some tips:
- MCP lets AI access your app data
- You can create multiple endpoints
- AI can call these endpoints automatically

🔧 Used tools: get_hello_message

Question 2:

What time is it?

AI Response:

The current time is 12:34:56 PM
Date: October 25, 2025
Timezone: America/New_York

🔧 Used tools: get_current_time

🎉 Congratulations! Your MCP server is working!

🎨 Visual Flow

/api/helloMCP ToolsOpenAI/api/chatChat UIUser/api/helloMCP ToolsOpenAI/api/chatChat UIUser"What's the hello message?"POST /api/chatSend message + toolsDecide to use get_hello_messageCall get_hello_message()GET /api/hello{message, tips}Return dataGenerate responseStream responseServer-Sent EventsShow formatted response

📁 Final Project Structure

my-mcp-app/
├── app/
│   ├── api/
│   │   ├── hello/
│   │   │   └── route.ts          # Your data endpoint
│   │   └── chat/
│   │       └── route.ts          # AI chat endpoint
│   ├── page.tsx                  # Chat UI
│   └── layout.tsx                # Layout (auto-generated)
├── lib/
│   └── mcp-tools.ts              # MCP tools definition
├── .env.local                    # OpenAI API key (don't commit!)
├── next.config.ts                # MCP enabled here
├── package.json
└── tsconfig.json

🧪 Testing Your MCP Server

Test 1: API Endpoint

curl http://localhost:3000/api/hello

Expected output:

{
  "message": "Hello from MCP!",
  "timestamp": "2025-10-25T00:00:00.000Z",
  "tips": [...]
}

Test 2: Chat with AI

Open http://localhost:3000 and ask:

“What’s the hello message?”
“What time is it?”
“Tell me about MCP”

🎯 Understanding the Key Concepts

1. MCP Endpoint (API Route)

// app/api/hello/route.ts
export async function GET() {
  return NextResponse.json({ message: "Hello!" });
}

What it does:

Provides data that AI can access
Just a regular Next.js API route
Returns JSON data

2. MCP Tool

// lib/mcp-tools.ts
export const mcpTools = {
  my_tool: tool({
    description: 'What this tool does',
    parameters: z.object({}),
    execute: async () => {
      // Your code here
      return { result: "data" };
    },
  }),
};

What it does:

Tells AI what the tool does (description)
Defines what inputs it needs (parameters)
Implements the actual functionality (execute)

3. AI Chat Endpoint

// app/api/chat/route.ts
const result = await streamText({
  model: openai('gpt-4o-mini'),
  messages,
  tools: mcpTools, // Give AI your tools
});

What it does:

Connects to OpenAI
Gives AI access to your MCP tools
Streams responses back to user

🚀 Next Steps: Expanding Your MCP Server

Add More Tools

// lib/mcp-tools.ts
export const mcpTools = {
  // Existing tools...
  
  // New tool: Get weather
  get_weather: tool({
    description: 'Get weather for a city',
    parameters: z.object({
      city: z.string().describe('City name'),
    }),
    execute: async ({ city }) => {
      // Call weather API
      return { city, temp: 72, condition: 'Sunny' };
    },
  }),
  
  // New tool: Calculate
  calculate: tool({
    description: 'Perform basic math calculations',
    parameters: z.object({
      operation: z.enum(['add', 'subtract', 'multiply', 'divide']),
      a: z.number(),
      b: z.number(),
    }),
    execute: async ({ operation, a, b }) => {
      switch (operation) {
        case 'add': return { result: a + b };
        case 'subtract': return { result: a - b };
        case 'multiply': return { result: a * b };
        case 'divide': return { result: a / b };
      }
    },
  }),
};

Add More Endpoints

// app/api/users/route.ts
export async function GET() {
  return NextResponse.json({
    users: [
      { id: 1, name: 'Alice' },
      { id: 2, name: 'Bob' },
    ]
  });
}

// Then create a tool to access it
get_users: tool({
  description: 'Get list of users',
  parameters: z.object({}),
  execute: async () => {
    const res = await fetch('http://localhost:3000/api/users');
    return await res.json();
  },
}),

🐛 Common Issues & Solutions

Issue 1: “OpenAI API key not configured”

Solution:

# Make sure .env.local exists
echo "OPENAI_API_KEY=sk-your-key" > .env.local

# Restart the server
npm run dev

Issue 2: “Module not found: Can’t resolve ‘ai'”

Solution:

# Install the AI SDK
npm install ai @ai-sdk/openai zod

Issue 3: “MCP not enabled”

Solution:

// Check next.config.ts
const nextConfig: NextConfig = {
  experimental: {
    mcp: {
      enabled: true, // Make sure this is true
      port: 3000,
    },
  },
};

Issue 4: “fetch failed” when calling tools

Solution:

// Use full URL in tools
execute: async () => {
  // ❌ Wrong
  const res = await fetch('/api/hello');
  
  // ✅ Correct
  const res = await fetch('http://localhost:3000/api/hello');
  return await res.json();
}

🤔 Wait… What’s the Difference Between API Endpoints and MCP Tools?

Great question! This confuses many beginners. Let me clarify:

They ARE Similar, But Serve Different Purposes

API Endpoint = The actual data source (like a kitchen)

// app/api/hello/route.ts
export async function GET() {
  return NextResponse.json({ message: "Hello" });
}

🔧 Provides raw data
📡 Anyone can call it via HTTP
🌐 URL: http://localhost:3000/api/hello
📦 Returns JSON directly

MCP Tool = AI’s way to access that data (like a waiter)

// lib/mcp-tools.ts
get_hello: tool({
  description: 'Get hello message',
  execute: async () => {
    // Calls the API endpoint
    const res = await fetch('http://localhost:3000/api/hello');
    return await res.json();
  },
})

🤖 Makes API accessible to AI
📝 Has description AI can understand
🎯 AI decides when to use it
✨ Formats data for AI processing

Visual Comparison

With MCP (AI-Powered)Natural LanguageDecides to useCallsReturns dataStructured dataNatural ResponseUserOpenAIMCP ToolAPI EndpointWithout MCP (Regular API)HTTP RequestRaw JSONUserAPI Endpoint

Real Example: The Difference

Scenario: Get user information

1. Direct API Call (No AI):

curl http://localhost:3000/api/users
# Returns: {"users": [{"id": 1, "name": "Alice", "age": 30}]}

2. Through MCP Tool (With AI):

User: "Who are the users and how old are they?"

AI: (uses get_users tool automatically)
    → Calls /api/users
    → Gets data
    → Processes it
    → Responds naturally

AI: "There are 2 users:
     - Alice is 30 years old
     - Bob is 25 years old"

Key Differences Table

Aspect	API Endpoint	MCP Tool
What it is	HTTP route	Function wrapper
Who calls it	Anyone (HTTP)	Only AI
How to call	`fetch()`, `curl`	AI decides automatically
Response	Raw JSON	Processed by AI
Purpose	Serve data	Give AI capabilities
Reusable	Yes, by anyone	Only by AI

Why Do You Need BOTH?

API Endpoint:

✅ The actual data source
✅ Can be used by other parts of your app
✅ Can be tested independently
✅ Follows REST conventions
✅ Can be cached and secured

MCP Tool:

✅ Tells AI what the endpoint does
✅ Provides context (description)
✅ Defines parameters AI can use
✅ Allows AI to use it intelligently
✅ Makes your app AI-powered

Restaurant Analogy

API Endpoint = Kitchen

Has the actual food (data)
Anyone can order from it
Returns raw dishes

MCP Tool = Waiter

Knows what’s available (description)
Takes your order (parameters)
Gets food from kitchen (calls API)
Serves it nicely (formats response)

AI = Smart Waiter

Understands what you want
Knows which dishes to recommend
Can combine multiple dishes
Explains the menu in your language

The Complete Flow

Your DataAPI EndpointMCP ToolOpenAIUserYour DataAPI EndpointMCP ToolOpenAIUserAI reads tool descriptionsDecides to use get_hello_messageAI processes & formats"What's the hello message?"execute()GET /api/helloFetch dataReturn dataJSON responseStructured data"The hello message is..."

Can You Skip the API Endpoint?

Short answer: Yes, but not recommended!

// ❌ MCP Tool without API (works but not ideal)
get_data: tool({
  execute: async () => {
    // Directly query database
    const data = await db.query('SELECT * FROM users');
    return data;
  },
})

// ✅ Better: MCP Tool + API (recommended)
get_data: tool({
  execute: async () => {
    // Call your API
    const res = await fetch('http://localhost:3000/api/users');
    return await res.json();
  },
})

Why separate API is better:

✅ Reusable by other parts of your app
✅ Can be tested independently
✅ Can be called by non-AI clients
✅ Easier to maintain and debug
✅ Can add authentication/caching

When to Use What

Use Just API Endpoint When:

Building a regular web app
Need data for your frontend
Other services need to call it
No AI involved

Use API + MCP Tool When:

Want AI to access the data
Building AI chat features
Need intelligent data retrieval
Want natural language interface

Summary: The Relationship

API Endpoint (data source)
    ↓
MCP Tool (AI access layer)
    ↓
OpenAI (intelligence)
    ↓
User (natural language)

Think of it this way:

API Endpoint = What you have (data)
MCP Tool = How AI accesses it (wrapper)
Together = AI-powered application 🚀

📚 Key Takeaways

MCP = Tools for AI
- You create tools (functions)
- AI decides when to use them
- AI can use multiple tools to answer questions
Three Main Parts:
- API endpoints (your data)
- MCP tools (how AI accesses data)
- Chat endpoint (connects AI to tools)
Simple Pattern:User asks question → AI reads question → AI calls your MCP tools → Tools fetch data → AI generates answer → User sees response

🎓 Learning Resources

Official Documentation

Example Projects

This dashboard (you’re looking at it!)
Vercel AI Examples

Video Tutorials

Search YouTube for “Next.js MCP tutorial”
Search for “Vercel AI SDK tutorial”

🎉 Congratulations!

You’ve built your first MCP server! You now understand:

✅ What MCP is and why it’s useful ✅ How to create API endpoints ✅ How to define MCP tools ✅ How to connect AI to your tools ✅ How to build a chat interface

Next Challenge: Try building a tool that:

Reads data from a database
Calls an external API
Performs calculations
Analyzes files

💡 Pro Tips

Start Simple
- Begin with one tool
- Add more as you learn
- Test each tool individually
Good Tool Descriptions// ❌ Bad description: 'Gets data' // ✅ Good description: 'Get user profile information including name, email, and registration date'
Use Parameters// Let AI provide inputs parameters: z.object({ userId: z.number().describe('The ID of the user to fetch'), includeOrders: z.boolean().describe('Whether to include order history'), })
Error Handlingexecute: async ({ userId }) => { try { const user = await fetchUser(userId); return { success: true, user }; } catch (error) { return { success: false, error: 'User not found' }; } }

🤝 Need Help?

Check the ARCHITECTURE.md for advanced concepts
Read CONFIGURATION.md for setup options
Review SETUP_GUIDE.md for detailed installation

Happy coding! 🚀

Kubernetes

Fixing Kubernetes PHP Pod CrashLoopBackOff: A Complete Guide

Post author By praison
Post date October 19, 2025

The Problem

Symptoms:

50% of PHP pods in CrashLoopBackOff
Pods restarting every 2-5 minutes
HTTP 499 errors on health checks
CPU constantly at limit (1500m)

Impact:

Website downtime
Poor user experience
Wasted cluster resources

Root Cause Analysis

Step 1: Check Pod Status

kubectl get pods -l app=<your-app>

Output:

NAME                   READY   STATUS             RESTARTS
app-xxxxxxxxxx-xxxxx   1/2     CrashLoopBackOff   34 (81s ago)
app-xxxxxxxxxx-xxxxx   1/2     CrashLoopBackOff   43 (45s ago)
app-xxxxxxxxxx-xxxxx   1/2     CrashLoopBackOff   146 (4m46s ago)

Step 2: Check Logs

# Check PHP-FPM logs
kubectl logs <pod-name> -c php-php --tail=100

# Check Nginx logs
kubectl logs <pod-name> -c php-nginx --tail=100

Key Finding:

10.0.9.107 - - [19/Oct/2025:05:11:18 +0000] "GET /test.php HTTP/1.1" 499 0

HTTP 499 = Client closed connection before server responded

Step 3: Check Health Probes

kubectl describe pod <pod-name> | grep -A 5 "Liveness\|Readiness"

Output:

Liveness:  http-get http://:80/test.php delay=60s timeout=10s period=60s
Readiness: http-get http://:80/test.php delay=60s timeout=10s period=60s

Problem: 10-second timeout too short!

Step 4: Check PHP-FPM Configuration

kubectl exec <pod-name> -- cat /usr/local/etc/php-fpm.d/www.conf | grep -E "^pm\.|^pm ="

Output:

pm = dynamic
pm.max_children = 5        ← TOO LOW!
pm.start_servers = 2
pm.min_spare_servers = 1
pm.max_spare_servers = 3

Step 5: Check Resource Usage

kubectl top pods -l app=<your-app> | head -15

Output:

NAME                   CPU(cores)   MEMORY(bytes)
app-xxxxxxxxxx-xxxxx   1500m        587Mi      ← CPU at limit!
app-xxxxxxxxxx-xxxxx   1m           524Mi      ← Crashed pod
app-xxxxxxxxxx-xxxxx   1m           526Mi      ← Crashed pod

Step 6: Analyze Traffic

kubectl logs <pod-name> --tail=500 | grep -E "GET|POST" | grep -v "test.php" | awk '{print $5, $6, $7}' | sort | uniq -c | sort -rn

Output:

263 "POST /wp/wp-admin/admin-ajax.php" 200
221 "GET /index.php" 200

Problem: 263+ concurrent requests, but only 5 workers!

Root Cause Identified

The Problem Chain:

Only 5 PHP-FPM workers available
263+ concurrent requests (WordPress admin-ajax.php is slow)
All workers busy → new requests queue
Health check arrives → also queues
10-second timeout expires → HTTP 499
3 consecutive failures → Kubernetes kills pod
Pod restarts → CrashLoopBackOff

Memory Math:

Available Memory: 1024Mi
Base + Nginx: ~500Mi
Available for PHP-FPM: ~524Mi

Current: 5 workers × 40MB = 200MB ✓
Needed: 20 workers × 40MB = 800MB ✓ (fits!)
Too much: 100 workers × 40MB = 4000MB ✗ (OOMKill!)

The Solution

Two Critical Changes:

Increase PHP-FPM workers (5 → 20)
Increase health check timeout (10s → 30s)

Implementation

Step 1: Create PHP-FPM ConfigMap

File: templates/php-fpm-configmap.yaml

apiVersion: v1
kind: ConfigMap
metadata:
  name: php-fpm-config
  labels:
    app: php
    tier: backend
data:
  www.conf: |

[www]

user = www-data group = www-data listen = 127.0.0.1:9000 pm = dynamic pm.max_children = 20 # 5 → 20 (4x capacity) pm.start_servers = 5 # 2 → 5 pm.min_spare_servers = 3 # 1 → 3 pm.max_spare_servers = 10 # 3 → 10 pm.max_requests = 500 request_terminate_timeout = 300 pm.status_path = /fpm-status catch_workers_output = yes clear_env = no

Step 2: Update Deployment

File: templates/deployment.yaml

Add ConfigMap volume:

volumes:
  - name: php-fpm-config
    configMap:
      name: php-fpm-config

Mount in PHP container:

containers:
  - name: php-php
    volumeMounts:
      - name: php-fpm-config
        mountPath: /usr/local/etc/php-fpm.d/www.conf
        subPath: www.conf

Update health checks:

- name: php-nginx
  livenessProbe:
    timeoutSeconds: 30      # 10 → 30
    periodSeconds: 30       # 60 → 30
    httpGet:
      path: /test.php
      port: 80
  readinessProbe:
    timeoutSeconds: 30      # 10 → 30
    periodSeconds: 30       # 60 → 30
    httpGet:
      path: /test.php
      port: 80

Step 3: Deploy Changes

# Validate YAML
helm lint ./charts/<your-chart>

# Dry-run to verify
helm template <release-name> ./charts/<your-chart> --debug | grep -A 20 "php-fpm-config"

# Apply ConfigMap first (important!)
kubectl apply -f charts/<your-chart>/templates/php-fpm-configmap.yaml

# Deploy with Helm
helm upgrade <release-name> ./charts/<your-chart> --namespace <namespace>

# Monitor rollout
kubectl rollout status deployment/<deployment-name> --timeout=300s

Step 4: Verify Deployment

# Check pod status
kubectl get pods -l app=<your-app>

# Verify PHP-FPM config loaded
kubectl exec <pod-name> -c php-php -- grep max_children /usr/local/etc/php-fpm.d/www.conf

# Expected output:
# pm.max_children = 20

# Verify health check settings
kubectl describe pod <pod-name> | grep -A 3 "Liveness:"

# Expected output:
# Liveness: http-get http://:80/test.php delay=60s timeout=30s period=30s

# Check for HTTP 499 errors (should be none)
kubectl logs <pod-name> -c php-nginx --tail=50 | grep 499

# Monitor resource usage
kubectl top pods -l app=php

Results

Before vs After

Metric	Before	After	Improvement
Pods Running	10/22 (45%)	22/22 (100%)	✅ +120%
CrashLoopBackOff	12 pods	0 pods	✅ Fixed
HTTP 499 Errors	Constant	None	✅ Eliminated
CPU Usage	1500m (limit)	347m-1202m	✅ -20-80%
Memory Usage	500-600Mi	677-787Mi	⚠️ +30% (expected)
Restarts	Every 2-5 min	0-3 total	✅ Stable
Worker Capacity	5 workers	20 workers	✅ 4x increase
Health Check Timeout	10s	30s	✅ 3x longer

Final Status:

kubectl get pods -l app=<your-app>

NAME                   READY   STATUS    RESTARTS   AGE
app-xxxxxxxxxx-xxxxx   2/2     Running   0          5m
app-xxxxxxxxxx-xxxxx   2/2     Running   0          5m
app-xxxxxxxxxx-xxxxx   2/2     Running   1          7m
app-xxxxxxxxxx-xxxxx   2/2     Running   0          7m
... (all pods healthy)

Why This Works

The Math:

Worker Capacity:

Before: 5 workers × 1 request = 5 concurrent requests
After:  20 workers × 1 request = 20 concurrent requests
Result: 4x capacity ✅

Memory Safety:

Memory Limit: 1024Mi
Base + Nginx: ~300Mi
20 workers × 40MB: ~800Mi
Total: ~1100Mi (within limit with headroom) ✅

Health Check Success:

Before: 10s timeout → fails when workers busy
After:  30s timeout → enough time to respond
Result: No false-positive failures ✅

Why Not 100 Workers?

Memory Constraint:

100 workers × 40MB = 4000MB
+ Base (300MB) + Nginx (200MB) = 4500MB
Memory Limit: 1024MB

Result: Instant OOMKill! ❌

CPU Constraint:

CPU Limit: 1500m (1.5 cores)
100 workers = 0.015 cores per worker
Result: Context switching overhead > actual work ❌

The Formula:

max_children = (Available RAM - Base Memory) / Memory per Worker

Your calculation:
max_children = (1024Mi - 300Mi) / 40Mi
max_children ≈ 18-20 workers ✅

Troubleshooting

If Pods Still Crash:

1. Check for OOMKills:

kubectl describe pod <pod-name> | grep -i oom

Solution: Reduce workers or increase memory limit

2. Check PHP-FPM status:

kubectl exec <pod-name> -c php-php -- curl -s http://localhost:9000/fpm-status

Look for:

active processes near max_children → increase workers
listen queue > 0 → workers overloaded

3. Check actual memory usage:

kubectl top pods -l app=php | sort -k3 -h

If > 900Mi consistently: Reduce workers to 15

4. Check logs for real errors:

kubectl logs <pod-name> -c php-php --previous

Optimization Tips

1. Monitor PHP-FPM Status

Expose status endpoint:

# In nginx config
location /fpm-status {
    access_log off;
    allow 127.0.0.1;
    deny all;
    fastcgi_pass 127.0.0.1:9000;
    fastcgi_param SCRIPT_FILENAME $document_root$fastcgi_script_name;
    include fastcgi_params;
}

Check status:

kubectl exec <pod> -c php-nginx -- curl -s http://localhost/fpm-status

2. Tune Based on Traffic

Low traffic (< 50 req/s):

pm.max_children = 15
pm.start_servers = 4

Medium traffic (50-150 req/s):

pm.max_children = 20
pm.start_servers = 5

High traffic (> 150 req/s):

# Increase memory limit first!
memory: 1536Mi
pm.max_children = 30
pm.start_servers = 8

3. Optimize WordPress

Disable Query Monitor in production:

// wp-config.php
define('QM_DISABLED', true);

Cache admin-ajax.php:

location = /wp-admin/admin-ajax.php {
    fastcgi_cache_valid 200 60s;
    fastcgi_cache_bypass $http_pragma $http_authorization;
}

Control WordPress Heartbeat:

// Reduce heartbeat frequency
wp.heartbeat.interval(60); // Default is 15s

Key Takeaways

✅ Do’s:

Calculate workers based on memory: (RAM - Base) / 40MB
Leave 20% memory headroom for traffic spikes
Increase health check timeouts when workers are busy
Monitor resource usage after changes
Test in staging first if possible

❌ Don’ts:

Don’t set workers arbitrarily (causes OOMKill)
Don’t ignore memory limits (Kubernetes will kill pods)
Don’t set timeout too short (false-positive failures)
Don’t forget to apply ConfigMap before deployment
Don’t skip verification after deployment

Commands Cheat Sheet

# Investigation
kubectl get pods -l app=<your-app>
kubectl describe pod <pod-name>
kubectl logs <pod-name> -c <container-name> --tail=100
kubectl logs <pod-name> -c <nginx-container> --tail=100
kubectl top pods -l app=<your-app>

# Check PHP-FPM config
kubectl exec <pod-name> -c <php-container> -- cat /usr/local/etc/php-fpm.d/www.conf

# Check health probes
kubectl describe pod <pod> | grep -A 5 "Liveness\|Readiness"

# Deployment
helm lint ./charts/<your-chart>
helm template <release-name> ./charts/<your-chart> --debug
kubectl apply -f charts/<your-chart>/templates/php-fpm-configmap.yaml
helm upgrade <release-name> ./charts/<your-chart> --namespace <namespace>
kubectl rollout status deployment/<deployment-name>

# Verification
kubectl get pods -l app=<your-app>
kubectl exec <pod-name> -c <php-container> -- grep max_children /usr/local/etc/php-fpm.d/www.conf
kubectl logs <pod-name> -c <nginx-container> --tail=50 | grep 499

# Monitoring
watch kubectl get pods -l app=<your-app>
kubectl top pods -l app=<your-app>
kubectl exec <pod-name> -c <php-container> -- curl -s http://localhost:9000/fpm-status

# Rollback (if needed)
helm rollback <release-name>
kubectl rollout undo deployment/<deployment-name>

Conclusion

Problem: PHP pods crashing due to worker exhaustion and short health check timeouts.

Solution: Increased PHP-FPM workers from 5 to 20 and health check timeout from 10s to 30s.

Result: All 22 pods stable, no crashes, 4x capacity increase.

Key Learning: Always calculate worker count based on available memory, not arbitrary numbers. The formula (RAM - Base) / 40MB ensures you stay within limits while maximizing capacity.

Resources

PHP-FPM Documentation: https://www.php.net/manual/en/install.fpm.configuration.php
Kubernetes Health Checks: https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/
WordPress Performance: https://developer.wordpress.org/advanced-administration/performance/optimization/

Author’s Note: This solution was implemented on a production Kubernetes cluster running a PHP application. The fix eliminated all CrashLoopBackOff issues and improved stability from 45% to 100% pod availability.

📊 Current Production Metrics Analysis

Actual Memory Usage (Right Now):

Average Memory per Pod: 665Mi
Memory Range: 544Mi - 616Mi
Memory Limit: 1024Mi (1Gi)
Utilization: 65% average (53-60% range)

Current Configuration:

Pods: 22 total
Workers per pod: 20
CPU Limit: 1500m
Average CPU Usage: 935m (62% utilization)
Restarts: 0-4 (minimal, stable)

🔍 Is 20 Workers Optimal? Let’s Calculate:

Method 1: Reverse Engineering from Memory Usage

Current Memory Usage: 665Mi average
Memory Limit: 1024Mi

Memory breakdown:
Total: 665Mi
- Base PHP + opcache: ~200Mi
- Nginx: ~100Mi
- PHP-FPM workers: 665 - 200 - 100 = 365Mi

Per worker: 366Mi / 20 workers = 18.3Mi per worker

Actual memory per worker: ~18-20MB ✅
(Lower than my 40MB estimate - your app is efficient!)

Method 2: Headroom Analysis

Current: 665Mi average
Limit: 1024Mi
Headroom: 358Mi (35%)

Safe headroom: 20-30% recommended
Your headroom: 35% ✅ OPTIMAL

Could we add more workers?
358Mi / 18Mi per worker = ~19 more workers possible
Total possible: 20 + 19 = 39 workers

BUT...

Method 3: CPU Constraint Check

Current CPU: 935m average
CPU Limit: 1500m
Utilization: 62%

With 20 workers: 935m / 20 = 46.75m per worker ✅
With 30 workers: 935m / 30 = 31.2m per worker ⚠️
With 39 workers: 935m / 39 = 24m per worker ❌ (too little)

CPU becomes the bottleneck before memory!

✅ Verdict: 20 Workers IS OPTIMAL

Why 20 is Perfect:

Metric	Current (20 workers)	If 30 workers	If 15 workers
Memory Usage	665Mi (65%) ✅	~850Mi (83%) ⚠️	~550Mi (54%) ✓
Memory Headroom	358Mi (35%) ✅	~174Mi (17%) ❌	~474Mi (46%) ⚠️
CPU per Worker	46.75m ✅	31.2m ⚠️	62.3m ✓
Stability	0-4 restarts ✅	Unknown	Would crash
Capacity	20 concurrent ✅	30 concurrent ⚠️	15 concurrent ❌

The Sweet Spot Analysis:

✅ 20 Workers:
- Memory: 65% (safe, room for spikes)
- CPU: 62% (efficient, not throttled)
- Headroom: 35% (perfect buffer)
- Stability: Excellent (0-4 restarts)
- Handles current load perfectly

⚠️ 30 Workers (Too Aggressive):
- Memory: 83% (risky, little headroom)
- CPU: Would need more (1500m not enough)
- Risk: Traffic spikes could OOMKill
- Not recommended

❌ 15 Workers (Too Conservative):
- Memory: 54% (underutilized)
- Would crash under current 263+ concurrent requests
- Wasted resources

🎯 Final Recommendation: KEEP 20 WORKERS

Reasons:

Memory Utilization: 65% – Perfect balance (not too high, not too low)
CPU Utilization: 62% – Efficient, room to grow
35% Headroom – Enough buffer for traffic spikes
Stable: 0-4 restarts – Proves it’s working well
Actual per-worker memory: 18-20MB – More efficient than estimated

Could We Optimize Further?

Option A: Increase to 25 workers (Moderate)

yamlpm.max_children = 25

Memory would be: ~750Mi (73%)
Headroom: ~274Mi (27%)
Risk: Medium (less buffer for spikes)
Benefit: 25% more capacity
Verdict: Only if you see consistent high load

Option B: Keep 20 workers (Recommended)

yamlpm.max_children = 20  # Current

Memory: 665Mi (65%) ✅
Headroom: 358Mi (35%) ✅
Risk: Low ✅
Stability: Proven ✅
Verdict: OPTIMAL – Don’t change!

📈 When to Reconsider:

Monitor these metrics and adjust if:

bash# Check if workers are maxed out
kubectl exec <pod> -c php-php -- curl -s http://localhost:9000/fpm-status

Increase workers if:

active processes consistently near 20
listen queue > 0 frequently
Memory still < 75% consistently

Decrease workers if:

Memory > 85% consistently
OOMKills occur
active processes rarely > 10

✅ Conclusion: 20 Workers is VERIFIED OPTIMAL

Based on actual production data:

✅ Memory: 665Mi / 1024Mi = 65% (perfect)
✅ CPU: 935m / 1500m = 62% (efficient)
✅ Headroom: 358Mi = 35% (safe buffer)
✅ Stability: 0-4 restarts (excellent)
✅ Handles 263+ concurrent requests

Testing

Gemini Computer Use Beginners Guide

Post author By praison
Post date October 7, 2025

pip install google-genai playwright
playwright install chromium
export GEMINI_API_KEY=xxxxxxx

Get API Key from: https://aistudio.google.com

from google import genai
from google.genai import types
from google.genai.types import Content, Part
from playwright.sync_api import sync_playwright
import time

# Initialize the Gemini client
client = genai.Client()

# Screen dimensions
SCREEN_WIDTH = 1440
SCREEN_HEIGHT = 900

def denormalize_x(x: int, screen_width: int) -> int:
    """Convert normalized x coordinate (0-1000) to actual pixel coordinate."""
    return int(x / 1000 * screen_width)

def denormalize_y(y: int, screen_height: int) -> int:
    """Convert normalized y coordinate (0-1000) to actual pixel coordinate."""
    return int(y / 1000 * screen_height)

def execute_function_calls(candidate, page, screen_width, screen_height):
    """Execute the actions suggested by the model."""
    results = []
    function_calls = []
    
    for part in candidate.content.parts:
        if part.function_call:
            function_calls.append(part.function_call)

    for function_call in function_calls:
        action_result = {}
        fname = function_call.name
        args = function_call.args
        print(f"  -> Executing: {fname}")

        try:
            if fname == "open_web_browser":
                pass  # Already open
            elif fname == "click_at":
                actual_x = denormalize_x(args["x"], screen_width)
                actual_y = denormalize_y(args["y"], screen_height)
                page.mouse.click(actual_x, actual_y)
            elif fname == "type_text_at":
                actual_x = denormalize_x(args["x"], screen_width)
                actual_y = denormalize_y(args["y"], screen_height)
                text = args["text"]
                press_enter = args.get("press_enter", False)

                page.mouse.click(actual_x, actual_y)
                page.keyboard.press("Meta+A")
                page.keyboard.press("Backspace")
                page.keyboard.type(text)
                if press_enter:
                    page.keyboard.press("Enter")
            
            page.wait_for_load_state(timeout=5000)
            time.sleep(1)

        except Exception as e:
            print(f"Error executing {fname}: {e}")
            action_result = {"error": str(e)}

        results.append((fname, action_result))

    return results

def get_function_responses(page, results):
    """Capture screenshot and URL after actions."""
    screenshot_bytes = page.screenshot(type="png")
    current_url = page.url
    function_responses = []
    
    for name, result in results:
        response_data = {"url": current_url}
        response_data.update(result)
        function_responses.append(
            types.FunctionResponse(
                name=name,
                response=response_data,
                parts=[types.FunctionResponsePart(
                    inline_data=types.FunctionResponseBlob(
                        mime_type="image/png",
                        data=screenshot_bytes))
                ]
            )
        )
    return function_responses

# Main program
print("Initialising browser...")
playwright = sync_playwright().start()
browser = playwright.chromium.launch(headless=False)
context = browser.new_context(viewport={"width": SCREEN_WIDTH, "height": SCREEN_HEIGHT})
page = context.new_page()

try:
    # Go to initial page
    page.goto("https://tinyurl.com/pet-care-signup")
    
    # Configure the model with Computer Use tool
    config = types.GenerateContentConfig(
        tools=[types.Tool(computer_use=types.ComputerUse(
            environment=types.Environment.ENVIRONMENT_BROWSER
        ))],
    )

    # Take initial screenshot
    initial_screenshot = page.screenshot(type="png")
    USER_PROMPT = """
    From https://tinyurl.com/pet-care-signup, 
    get all details for any pet with a California residency. 
    Output all the information you find in a clear, readable format.
    """
    print(f"Goal: {USER_PROMPT}")

    contents = [
        Content(role="user", parts=[
            Part(text=USER_PROMPT),
            Part.from_bytes(data=initial_screenshot, mime_type='image/png')
        ])
    ]

    # Agent loop - maximum 5 turns
    for i in range(5):
        print(f"\n--- Turn {i+1} ---")
        print("Thinking...")
        
        response = client.models.generate_content(
            model='gemini-2.5-computer-use-preview-10-2025',
            contents=contents,
            config=config,
        )

        candidate = response.candidates[0]
        contents.append(candidate.content)

        # Check if there are function calls to execute
        has_function_calls = any(part.function_call for part in candidate.content.parts)
        if not has_function_calls:
            text_response = " ".join([part.text for part in candidate.content.parts if part.text])
            print("Agent finished:", text_response)
            break

        print("Executing actions...")
        results = execute_function_calls(candidate, page, SCREEN_WIDTH, SCREEN_HEIGHT)

        print("Capturing state...")
        function_responses = get_function_responses(page, results)

        contents.append(
            Content(role="user", parts=[Part(function_response=fr) for fr in function_responses])
        )

finally:
    print("\nClosing browser...")
    browser.close()
    playwright.stop()
    print("Done!")

from google import genai
from google.genai import types
from google.genai.types import Content, Part
from playwright.sync_api import sync_playwright
import time

# Initialize the Gemini client
client = genai.Client()

# Screen dimensions
SCREEN_WIDTH = 1440
SCREEN_HEIGHT = 900

def denormalize_x(x: int, screen_width: int) -> int:
    """Convert normalized x coordinate (0-1000) to actual pixel coordinate."""
    return int(x / 1000 * screen_width)

def denormalize_y(y: int, screen_height: int) -> int:
    """Convert normalized y coordinate (0-1000) to actual pixel coordinate."""
    return int(y / 1000 * screen_height)

def execute_function_calls(candidate, page, screen_width, screen_height):
    """Execute the actions suggested by the model."""
    results = []
    function_calls = []
    
    for part in candidate.content.parts:
        if part.function_call:
            function_calls.append(part.function_call)

    for function_call in function_calls:
        action_result = {}
        fname = function_call.name
        args = function_call.args
        print(f"  -> Executing: {fname}")

        try:
            if fname == "open_web_browser":
                pass  # Already open
            elif fname == "click_at":
                actual_x = denormalize_x(args["x"], screen_width)
                actual_y = denormalize_y(args["y"], screen_height)
                page.mouse.click(actual_x, actual_y)
            elif fname == "type_text_at":
                actual_x = denormalize_x(args["x"], screen_width)
                actual_y = denormalize_y(args["y"], screen_height)
                text = args["text"]
                press_enter = args.get("press_enter", False)

                page.mouse.click(actual_x, actual_y)
                page.keyboard.press("Meta+A")
                page.keyboard.press("Backspace")
                page.keyboard.type(text)
                if press_enter:
                    page.keyboard.press("Enter")
            elif fname == "drag_and_drop":
                start_x = denormalize_x(args["x"], screen_width)
                start_y = denormalize_y(args["y"], screen_height)
                dest_x = denormalize_x(args["destination_x"], screen_width)
                dest_y = denormalize_y(args["destination_y"], screen_height)
                
                # Perform drag and drop
                page.mouse.move(start_x, start_y)
                page.mouse.down()
                page.mouse.move(dest_x, dest_y)
                page.mouse.up()
            
            page.wait_for_load_state(timeout=5000)
            time.sleep(1)

        except Exception as e:
            print(f"Error executing {fname}: {e}")
            action_result = {"error": str(e)}

        results.append((fname, action_result))

    return results

def get_function_responses(page, results):
    """Capture screenshot and URL after actions."""
    screenshot_bytes = page.screenshot(type="png")
    current_url = page.url
    function_responses = []
    
    for name, result in results:
        response_data = {"url": current_url}
        response_data.update(result)
        function_responses.append(
            types.FunctionResponse(
                name=name,
                response=response_data,
                parts=[types.FunctionResponsePart(
                    inline_data=types.FunctionResponseBlob(
                        mime_type="image/png",
                        data=screenshot_bytes))
                ]
            )
        )
    return function_responses

# Main program
print("Initialising browser...")
playwright = sync_playwright().start()
browser = playwright.chromium.launch(headless=False)
context = browser.new_context(viewport={"width": SCREEN_WIDTH, "height": SCREEN_HEIGHT})
page = context.new_page()

try:
    # Go to initial page
    page.goto("https://sticky-note-jam.web.app")
    
    # Configure the model with Computer Use tool
    config = types.GenerateContentConfig(
        tools=[types.Tool(computer_use=types.ComputerUse(
            environment=types.Environment.ENVIRONMENT_BROWSER
        ))],
    )

    # Take initial screenshot
    initial_screenshot = page.screenshot(type="png")
    USER_PROMPT = """
    My art club brainstormed tasks ahead of our fair. 
    The board is chaotic and I need your help organising the tasks into some categories I created. 
    Go to sticky-note-jam.web.app and 
    ensure notes are clearly in the right sections. 
    Drag them there if not. 
    In your output, describe what the initial stage looked like and 
    what the final stage looks like after organisation.
    """
    print(f"Goal: {USER_PROMPT}")

    contents = [
        Content(role="user", parts=[
            Part(text=USER_PROMPT),
            Part.from_bytes(data=initial_screenshot, mime_type='image/png')
        ])
    ]

    # Agent loop - maximum 10 turns (more turns for drag operations)
    for i in range(10):
        print(f"\n--- Turn {i+1} ---")
        print("Thinking...")
        
        response = client.models.generate_content(
            model='gemini-2.5-computer-use-preview-10-2025',
            contents=contents,
            config=config,
        )

        candidate = response.candidates[0]
        contents.append(candidate.content)

        # Check if there are function calls to execute
        has_function_calls = any(part.function_call for part in candidate.content.parts)
        if not has_function_calls:
            text_response = " ".join([part.text for part in candidate.content.parts if part.text])
            print("Agent finished:", text_response)
            break

        print("Executing actions...")
        results = execute_function_calls(candidate, page, SCREEN_WIDTH, SCREEN_HEIGHT)

        print("Capturing state...")
        function_responses = get_function_responses(page, results)

        contents.append(
            Content(role="user", parts=[Part(function_response=fr) for fr in function_responses])
        )

finally:
    print("\nClosing browser...")
    browser.close()
    playwright.stop()
    print("Done!")

AI Agents

Claude Agent SDK Beginners Tutorial

Post author By praison
Post date October 4, 2025

Installation Commands:

# Install Claude Code
npm install -g @anthropic-ai/claude-code

# Install Claude Agents SDK
pip install claude-agent-sdk

# Set API Key
export ANTHROPIC_API_KEY=your_api_key_here

Basic

import asyncio
from claude_agent_sdk import query

async def main():
    async for message in query(prompt="Hello, how are you?"):
        print(message)
        
asyncio.run(main())

Inbuild Tools

import asyncio
from claude_agent_sdk import query, ClaudeAgentOptions
from rich import print

async def main():
    
    options = ClaudeAgentOptions(
        allowed_tools=["Read", "Write"],
        permission_mode="acceptEdits"
    )
    
    async for msg in query(
        prompt="Create a file called greeting.txt with 'Hello Mervin Praison!'",
        options=options
    ):
        print(msg)

asyncio.run(main())

Custom Tools

import asyncio
from typing import Any
from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions, tool, create_sdk_mcp_server
from rich import print

@tool("greet", "Greet a user", {"name": str})
async def greet(args: dict[str, Any]) -> dict[str, Any]:
    return {
        "content": [{
            "type": "text",
            "text": f"Hello, {args['name']}!"
        }]
    }

server = create_sdk_mcp_server(
    name="my-tools",
    version="1.0.0",
    tools=[greet]
)

async def main():
    options = ClaudeAgentOptions(
        mcp_servers={"tools": server},
        allowed_tools=["mcp__tools__greet"]
    )

    async with ClaudeSDKClient(options=options) as client:
        await client.query("Greet Mervin Praison")
        async for msg in client.receive_response():
            print(msg)

asyncio.run(main())

Claude Agent Options

import asyncio
from claude_agent_sdk import query, ClaudeAgentOptions
from rich import print
async def main():
    options = ClaudeAgentOptions(
        system_prompt="You are an expert Python developer",
        permission_mode='acceptEdits',
        cwd="/Users/praison/cc"
    )

    async for message in query(
        prompt="Create a Python web server in my current directory",
        options=options
    ):
        print(message)

asyncio.run(main())

Application

Gemini Image Editing Code

Post author By praison
Post date August 26, 2025

export GOOGLE_API_KEY=xxxxxxx
pip install google-genai gradio

App

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client()

prompt = "Add a Cap to the person's head"

image = Image.open('mervinpraison.jpeg')

response = client.models.generate_content(
    model="gemini-2.5-flash-image-preview",
    contents=[prompt, image],
)

for part in response.candidates[0].content.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = Image.open(BytesIO(part.inline_data.data))   
        image.save("generated_image.png")
        print("Generated image saved as 'generated_image.png'")

UI.py

import gradio as gr
from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client()

def edit_image(image, prompt):
    response = client.models.generate_content(
        model="gemini-2.5-flash-image-preview",
        contents=[prompt, image],
    )
    
    if response.candidates[0].finish_reason.name == 'PROHIBITED_CONTENT':
        return None, "Content blocked by safety filters"
    elif response.candidates[0].content is None:
        return None, f"No content generated: {response.candidates[0].finish_reason.name}"
    
    for part in response.candidates[0].content.parts:
        if part.inline_data is not None:
            return Image.open(BytesIO(part.inline_data.data)), "Image generated successfully"
    
    return None, "No image found in response"

iface = gr.Interface(
    fn=edit_image,
    inputs=[
        gr.Image(type="pil", label="Upload Image"),
        gr.Textbox(label="Edit Prompt", value="Add a Cap to the person's head")
    ],
    outputs=[
        gr.Image(label="Edited Image"),
        gr.Textbox(label="Status")
    ],
    title="Image Editor"
)

iface.launch()

Research-Based Recommendations

🎯 Key Principles for Consistent Results

1. Use Binary or Low-Precision Scoring

2. Provide Detailed Rubric with Examples

3. Split Complex Criteria into Separate Evaluations

4. Use Chain-of-Thought (CoT) Prompting

5. Set Low Temperature for Consistency

6. Include Few-Shot Examples

7. Mitigate Known Biases

8. Handle Uncertainty Explicitly

9. Use Structured JSON Output

10. Choose the Right Judge Model

📋 Complete Prompt Template (Research-Based)

📊 Comparison: Before vs After Best Practices

Before (Simple Approach)

After (Best Practices)

🎯 Summary Checklist

📚 References

TOON Python Examples: Efficient Data Encoding for LLMs

Installation

Example 1: Basic Encoding

Example 2: Encoding with Custom Options

Example 3: Advanced Types and Nested Structures

Key Features

Resources

Install requirements

Basic

Streaming Response

UI with Chainlit

RAG (Retrieval Augmented Generation)

Table of Contents

Initial Setup & Structure

Essential Files

Main Plugin File Header

readme.txt Structure

Core Development Requirements

1. Proper Asset Enqueuing

2. AJAX Implementation

3. Settings API

Security Best Practices

1. Input Sanitization

2. Output Escaping

3. Nonce Verification

4. Capability Checks

5. SQL Queries

WordPress.org Submission Requirements

1. External Services Documentation

2. Ownership Verification

3. No Inline Scripts or Styles

4. Proper File Structure

Common Review Issues & Solutions

Issue 1: Inline Scripts

Issue 2: Missing Sanitization

Issue 3: Direct File Access

Issue 4: Hardcoded Paths

Issue 5: Missing Text Domain

SVN Publishing Process

Initial Setup

Publishing Your Plugin

Updating Your Plugin

Adding Assets

Post-Launch Maintenance

1. Monitor Support Forum

2. Regular Updates

3. Security Updates

4. Version Control Best Practices

5. Monitoring Tools

Pre-Submission Checklist

Code Quality

Documentation

Security

Ownership

Testing

Common Mistakes to Avoid

1. Using Outdated Functions

2. Not Following WordPress Coding Standards

3. Forgetting Uninstall Cleanup

4. Hardcoding URLs

5. Not Preparing for Internationalization

Useful Resources