Mervin Praison

Praison AI

Elon Musk Mervin Praison

Post author By praison
Post date March 14, 2025

Grok 3 customer support https://t.co/wyXwpVCta7
— Elon Musk (@elonmusk) February 24, 2025

Ollama

OpenAI Agents SDK Ollama

Post author By praison
Post date March 14, 2025

Installation

Download Ollama from https://ollama.com

pip install -U openai-agents chainlit duckduckgo-search
ollama pull llama3.2

Single Agent

from agents import Agent, Runner, OpenAIChatCompletionsModel, AsyncOpenAI

model = OpenAIChatCompletionsModel( 
    model="llama3.2",
    openai_client=AsyncOpenAI(base_url="http://localhost:11434/v1")
)

agent = Agent(name="Assistant",
              instructions="You are a helpful assistant",
              model=model)

result = Runner.run_sync(agent, "Create a meal plan for a week.")
print(result.final_output)

python app.py

Multi Agents

from duckduckgo_search import DDGS
from agents import Agent, Runner, AsyncOpenAI, OpenAIChatCompletionsModel, function_tool
from datetime import datetime

current_date = datetime.now().strftime("%Y-%m")

model = OpenAIChatCompletionsModel(
    model="llama3.2",
    openai_client=AsyncOpenAI(base_url="http://localhost:11434/v1")
)

# 1. Create Internet Search Tool

@function_tool
def get_news_articles(topic):
    print(f"Running DuckDuckGo news search for {topic}...")
    
    # DuckDuckGo search
    ddg_api = DDGS()
    results = ddg_api.text(f"{topic} {current_date}", max_results=5)
    if results:
        news_results = "\n\n".join([f"Title: {result['title']}\nURL: {result['href']}\nDescription: {result['body']}" for result in results])
        print(news_results)
        return news_results
    else:
        return f"Could not find news results for {topic}."
    
# 2. Create AI Agents

# News Agent to fetch news
news_agent = Agent(
    name="News Assistant",
    instructions="You provide the latest news articles for a given topic using DuckDuckGo search.",
    tools=[get_news_articles],
    model=model
)

# Editor Agent to edit news
editor_agent = Agent(
    name="Editor Assistant",
    instructions="Rewrite and give me as news article ready for publishing. Each News story in separate section.",
    model=model
)

# 3. Create workflow

def run_news_workflow(topic):
    print("Running news Agent workflow...")
    
    # Step 1: Fetch news
    news_response = Runner.run_sync(
        news_agent,
        f"Get me the news about {topic} on {current_date}"
    )
    
    # Access the content from RunResult object
    raw_news = news_response.final_output
    
    # Step 2: Pass news to editor for final review
    edited_news_response = Runner.run_sync(
        editor_agent,
        raw_news
    )
    
    # Access the content from RunResult object
    edited_news = edited_news_response.final_output
    
    print("Final news article:")
    print(edited_news)
    
    return edited_news

# Example of running the news workflow for a given topic
print(run_news_workflow("AI"))

UI

import chainlit as cl
from news import run_news_workflow

@cl.on_message
async def main(message: cl.Message):
    """
    Main function to handle user messages and run the news workflow.
    """
    # Get the topic from the user message
    topic = message.content
    
    # Send a thinking message
    await cl.Message(
        content=f"Searching for news about '{topic}'...",
        author="News Bot"
    ).send()
    
    try:
        # Run the news workflow
        news_content = run_news_workflow(topic)
        
        # Send the result back to the user
        await cl.Message(
            content=news_content,
            author="News Bot"
        ).send()
    except Exception as e:
        # Handle any errors
        await cl.Message(
            content=f"Error fetching news: {str(e)}",
            author="News Bot"
        ).send()

@cl.on_chat_start
async def start():
    """
    Function that runs when a new chat session starts.
    """
    # Send a welcome message
    await cl.Message(
        content="Welcome to the News Assistant! What topic would you like to get news about?",
        author="News Bot"
    ).send()

chainlit run ui.py

Ollama

Gemma 3 Create Agents

Post author By praison
Post date March 13, 2025

ollama pull gemma3
pip install ollama chainlit "praisonaiagents[llm]"

Ollama code

import ollama

response = ollama.chat(
    model='gemma3',
    messages=[
        {
        'role': 'user',
        'content': 'Give me a meal plan for today',
        },
    ],
)

print(response['message']['content'])

python app.py

UI

ui.py file

import ollama
import chainlit as cl

@cl.on_message
async def main(message: cl.Message):
    response = ollama.chat(
        model='gemma3',
        messages=[
            {
            'role': 'user',
            'content': message.content,
            },
        ],
    )
    
    await cl.Message(content=response['message']['content']).send()

@cl.on_chat_start
async def start():
    await cl.Message(content="Hello! I'm Gemma3. How can I help you today?").send()

chainlit run ui.py

Single Agent

from praisonaiagents import Agent

agent = Agent( instructions="You are a helpful assistant",llm="ollama/gemma3")

agent.start("Why sky is Blue?")

python app.py

Multi Agents

from praisonaiagents import Agent, PraisonAIAgents
from praisonaiagents.tools import internet_search

agent1 = Agent(instructions="Write a linkedIn post", tools=[internet_search], llm="ollama/gemma3")
agent2 = Agent(instructions="Write a tweet based on the linkedIn post", llm="ollama/gemma3")

agents = PraisonAIAgents(agents=[agent1, agent2])

agents.start("Write about donald trump 2025 election")

python app.py

OpenAI

OpenAI Responses API Basics

Post author By praison
Post date March 12, 2025

pip install openai

export OPENAI_API_KEY=xxxxxxxxxxxxx

Responses API

from openai import OpenAI
from rich import print
client = OpenAI()

response = client.responses.create(
    model="gpt-4o",
    input="Write a meal plan for a week."
)

print(response.output_text)

Chatbot

pip install gradio

from openai import OpenAI
from rich import print
import gradio as gr

client = OpenAI()

def generate_response(prompt):
    response = client.responses.create(
        model="gpt-4o",
        input=prompt
    )
    return response.output_text

# Create Gradio interface
demo = gr.Interface(
    fn=generate_response,
    inputs=gr.Textbox(placeholder="Enter your prompt here...", label="Prompt", value="Write a meal plan for a week."),
    outputs=gr.Textbox(label="Response"),
    title="OpenAI GPT-4o",
    description="Enter a prompt to get a response from GPT-4o"
)

if __name__ == "__main__":
    demo.launch()

Web Search

from openai import OpenAI
client = OpenAI()

response = client.responses.create(
    model="gpt-4o",
    tools=[{"type": "web_search_preview"}],
    input="Tell me about Mervin Praison"
)

print(response.output_text)

File Search

from openai import OpenAI
from rich import print
client = OpenAI()

response = client.responses.create(
    model="gpt-4o-mini",
    input="Tell me about GraphRAG",
    tools=[{
        "type": "file_search",
        "vector_store_ids": ["vs_67d08c2b03ac8191baff7fbbfcc7ffd2"]
    }]
)
print(response)

Computer Use

from openai import OpenAI
client = OpenAI()

response = client.responses.create(
    model="computer-use-preview",
    tools=[{
        "type": "computer_use_preview",
        "display_width": 1024,
        "display_height": 768,
        "environment": "browser" # other possible values: "mac", "windows", "ubuntu"
    }],
    input=[
        {
            "role": "user",
            "content": "Check the latest OpenAI news on bing.com."
        }
    ],
    truncation="auto"
)

print(response.output)

Tool

from agents import Agent, ModelSettings, function_tool, Runner

def get_weather(city: str) -> str:
    return f"The weather in {city} is sunny and 34 degree celsius"

agent = Agent(
    name="Haiku agent",
    instructions="Always respond in haiku form",
    model="gpt-4o-mini",
    tools=[function_tool(get_weather)],
)

result = Runner.run_sync(agent, "What's the weather in Tokyo?")
print(result.final_output)

Agents SDK

pip install openai-agents

from agents import Agent, Runner

agent = Agent(name="Assistant", instructions="You are a helpful assistant")

result = Runner.run_sync(agent, "Create a meal plan for a week.")
print(result.final_output)

Tools

PraisonAI Vision Model Training config.yaml

Post author By praison
Post date February 9, 2025

config.yaml

dataset:
- name: mervinpraison/Radiology_mini-10rows
dataset_num_proc: 2
dataset_text_field: text
gradient_accumulation_steps: 4
hf_model_name: mervinpraison/Llama-3.2-11B-Vision-test
huggingface_save: 'true'
learning_rate: 0.0002
load_in_4bit: true
loftq_config: null
logging_steps: 1
lora_alpha: 16
lora_bias: none
lora_dropout: 0
lora_r: 16
lora_target_modules:
- q_proj
- k_proj
- v_proj
- o_proj
- gate_proj
- up_proj
- down_proj
lr_scheduler_type: linear
max_seq_length: 2048
max_steps: 10
model_name: unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
model_parameters: 14b
num_train_epochs: 1
ollama_model: mervinpraison/llama-3.2-11b-vision-test
ollama_save: 'true'
optim: adamw_8bit
output_dir: outputs
packing: false
per_device_train_batch_size: 1
quantization_method:
- q4_k_m
random_state: 3407
seed: 3407
train: 'true'
use_gradient_checkpointing: unsloth
use_rslora: false
warmup_steps: 5
weight_decay: 0.01

# Vision-specific parameters
finetune_vision_layers: false
finetune_language_layers: true
finetune_attention_modules: true
finetune_mlp_modules: true
vision_instruction: "You are an expert radiographer. Describe accurately what you see in this image."

Finetuning

PraisonAI Train Test Config.yaml

Post author By praison
Post date February 7, 2025

dataset:
- name: mervinpraison/alpaca-cleaned-10rows
dataset_num_proc: 2
dataset_text_field: text
gradient_accumulation_steps: 2
hf_model_name: mervinpraison/llama3.2-1B-instruct-test
huggingface_save: 'true'
learning_rate: 0.0002
load_in_4bit: true
loftq_config: null
logging_steps: 2
lora_alpha: 16
lora_bias: none
lora_dropout: 0
lora_r: 16
lora_target_modules:
- q_proj
- k_proj
- v_proj
- o_proj
- gate_proj
- up_proj
- down_proj
lr_scheduler_type: linear
max_seq_length: 2048
max_steps: 10
model_name: unsloth/Llama-3.2-1B-Instruct-bnb-4bit
model_parameters: 1b
num_train_epochs: 1
ollama_model: mervinpraison/llama3.2-1B-instruct-test
ollama_save: 'true'
optim: adamw_8bit
output_dir: outputs
packing: false
per_device_train_batch_size: 2
quantization_method:
- q4_k_m
random_state: 3407
seed: 3407
train: 'true'
use_gradient_checkpointing: unsloth
use_rslora: false
warmup_steps: 5
weight_decay: 0.01

Dataset

Find Sequence Length of Dataset

Post author By praison
Post date February 7, 2025

pip install datasets matplotlib

from datasets import load_dataset
import matplotlib.pyplot as plt

# Load dataset
dataset = load_dataset("yahma/alpaca-cleaned")

# Select the text column (assuming it's named 'text', modify if needed)
column_name = 'input'  # Change this if the text data is stored under a different column

# Compute sequence lengths
sequence_lengths = [len(text.split()) for text in dataset['train'][column_name]]

# Plot histogram
plt.figure(figsize=(10, 5))
plt.hist(sequence_lengths, bins=50, edgecolor='black')
plt.xlabel("Sequence Length (in words)")
plt.ylabel("Frequency")
plt.title("Distribution of Sequence Lengths")
plt.show()

# Print basic statistics
import numpy as np
print(f"Mean length: {np.mean(sequence_lengths):.2f}")
print(f"Median length: {np.median(sequence_lengths):.2f}")
print(f"Max length: {np.max(sequence_lengths)}")
print(f"Min length: {np.min(sequence_lengths)}")

from datasets import load_dataset
import matplotlib.pyplot as plt
import numpy as np

# Load dataset
dataset = load_dataset("yahma/alpaca-cleaned")

# Get all column names (assuming 'train' split exists)
all_columns = dataset["train"].column_names
print("All columns:", all_columns)

# Filter columns that include 'instruction' or 'input'
filtered_columns = [col for col in all_columns if "instruction" in col.lower() or "input" in col.lower()]
print("Filtered columns:", filtered_columns)

# Choose column for analysis: prefer "instruction" if available, else "input", otherwise the first found column
if "instruction" in filtered_columns:
    column_name = "instruction"
elif "input" in filtered_columns:
    column_name = "input"
elif filtered_columns:
    column_name = filtered_columns[0]
else:
    raise ValueError("No column containing 'instruction' or 'input' found.")

# Compute sequence lengths (word count) for the selected column in the 'train' split
sequence_lengths = [len(text.split()) for text in dataset["train"][column_name]]

# Plot histogram
plt.figure(figsize=(10, 5))
plt.hist(sequence_lengths, bins=50, edgecolor='black')
plt.xlabel("Sequence Length (in words)")
plt.ylabel("Frequency")
plt.title(f"Distribution of Sequence Lengths for '{column_name}' column")
plt.show()

# Print basic statistics
print(f"Mean length: {np.mean(sequence_lengths):.2f}")
print(f"Median length: {np.median(sequence_lengths):.2f}")
print(f"Max length: {np.max(sequence_lengths)}")
print(f"Min length: {np.min(sequence_lengths)}")

Finetuning

Llama 3 Training LLM Config

Post author By praison
Post date February 6, 2025

Your current training configuration is mostly well-tuned for a 3B model with a small dataset (2.24k rows), but a few tweaks could improve efficiency and stability. Here’s a review with suggested adjustments:

✅ Good Values

Parameter	Current Value	Verdict	Notes
learning_rate	`0.0002`	✅ Good	Works well with `adamw_8bit`. Slightly high, but `linear` LR scheduler mitigates risks.
optim	`adamw_8bit`	✅ Good	Efficient for 4-bit training.
weight_decay	`0.01`	✅ Good	Helps regularization.
quantization_method	`q4_k_m`	✅ Good	Suitable for 4-bit quantized models.
loftq_config	`null`	✅ Good	No need for LoFTQ with 4-bit quantization.
lora_alpha	`16`	✅ Good	Works well with `lora_r=16`.
lora_dropout	`0`	✅ Good	Fine for small datasets if no overfitting is seen.
use_gradient_checkpointing	`unsloth`	✅ Good	Reduces memory usage efficiently.
use_rslora	`false`	✅ Good	Not needed unless dealing with large multi-task training.

🔧 Suggested Adjustments

Parameter	Current Value	Suggested Value	Why?
num_train_epochs	`5`	✅ Keep, but monitor	Fine for now. Adjust dynamically based on loss.
max_steps	`10`	❌ Remove or set to `2800` (560 steps/epoch × 5 epochs)	Currently limits training to only 10 steps, preventing full training.
per_device_train_batch_size	`2`	✅ Keep	Given 4-bit quantization, this is fine. Increasing might require more VRAM.
gradient_accumulation_steps	`2`	✅ Keep	Effectively doubles batch size without needing more VRAM.
warmup_steps	`5`	🔧 Increase to `50` (2%)	`5` is too low. Standard is `2-5%` of total steps (~50 for 5 epochs).
logging_steps	`1`	🔧 Increase to `10-20`	Logging every step might slow training. Try `10-20` for better performance.

📌 Final Recommendations

Remove max_steps=10 (or increase to 2800 for full 5 epochs).
Increase warmup_steps=5 → 50 (to prevent unstable LR at the start).
Adjust logging_steps=1 → 10-20 (for efficiency).
Monitor loss curves and adjust num_train_epochs dynamically.

LLM

LLM Training Methods: A Concise Guide

Post author By praison
Post date February 5, 2025

LLM Training order

Training Stage	Description	Order	Notes
Pre-training (Self-Supervised)	Train on vast unlabelled text to predict the next token.	1	Fundamental; forms the base model.
Supervised Fine-Tuning (SFT/FT)	Fine-tune the pre-trained model using labelled data for specific tasks.	2 (Optional)	Improves task performance; may include instruction tuning.
Reinforcement Learning (RL)	Optimise policies through trial-and-error interactions.	3 (Optional)	Often combined with human feedback rather than used alone.
Reinforcement Learning with Human Feedback (RLHF)	Integrate human-guided RL to align outputs with human preferences.	3 (Optional)	Typically follows SFT; enhances safety and alignment.
Other Methods	Techniques such as instruction tuning, chain-of-thought prompting, distillation.	Varies	Can be applied within or after the above stages.

All Training Methods

Method	Description	Stage/Notes
Pre-training (Self-Supervised)	Train on vast unlabelled data to predict the next token.	Base training; fundamental stage.
Supervised Fine-Tuning (SFT/FT)	Fine-tune the pre-trained model using labelled data for specific tasks.	Optional Stage 2; improves task performance.
Reinforcement Learning (RL)	Optimises policies through trial-and-error to maximise reward.	Optional; often integrated with human feedback.
Reinforcement Learning with Human Feedback (RLHF)	Combines RL with human feedback to align outputs with human preferences.	Typically follows SFT; enhances safety and alignment.
Instruction Tuning	Fine-tuning with instruction-based data to improve response to commands.	Can be integrated during SFT.
Chain-of-Thought Prompting	Incorporates step-by-step reasoning in prompts to aid problem-solving.	More a prompting strategy than a full training method.
Knowledge Distillation	Trains a smaller model (student) to mimic a larger model (teacher) for efficiency.	Used for model compression and efficiency gains.
Prompt Tuning	Optimises input prompts while keeping the main model weights fixed.	Offers lightweight adaptation with minimal parameter updates.
Adapter Tuning	Inserts small trainable modules (adapters) into a pre-trained model.	Enables efficient fine-tuning without altering the full model.
Continual Learning	Incrementally trains on new data while preserving previously acquired knowledge.	Prevents catastrophic forgetting.
Meta-Learning	Teaches the model to learn how to learn and adapt quickly to new tasks.	Also known as “learning to learn”.
Multi-Task Learning	Simultaneously trains on multiple tasks to share and transfer knowledge.	Enhances generalisation across tasks.
Curriculum Learning	Organises training data in order from simple to complex tasks.	Mimics natural human learning progression.
Contrastive Learning	Learns representations by contrasting similar and dissimilar data points.	Common in unsupervised representation learning.
Semi-Supervised Learning	Combines a small amount of labelled data with a large pool of unlabelled data.	Leverages unlabelled data to improve performance.
Self-Training	Uses the model’s own predictions as pseudo-labels to further train on unlabelled data.	An iterative process to enhance learning from unlabelled data.