Skip to content

Latest commit

 

History

History
1970 lines (1515 loc) · 69.2 KB

File metadata and controls

1970 lines (1515 loc) · 69.2 KB

BTrace Tutorial (BTrace 2.3.0)

Note: Examples use btrace.jar -- the single masked JAR (BTrace 2.2+). If using a legacy multi-JAR distribution, replace btrace.jar with btrace-agent.jar (and add -Xbootclasspath/a:btrace-boot.jar where needed).

Package namespace: BTrace 2.3+ uses the io.btrace package (previously org.openjdk.btrace). The compiler auto-injects import static io.btrace.BTrace.* into every script, so common helpers like println, str, monotonic, etc. are available without any import. Legacy probes compiled against org.openjdk.btrace are transparently migrated at load time.

Related documentation

Document What's in it
Getting Started Installation, first run
Quick Reference All annotations, parameters, built-in functions
Oneliner Guide DTrace-style one-liners (btrace -n '...')
Extension Development Guide Writing and shipping extensions
Extension Interface Rules API design contract that the build enforces
FAQ Common questions and answers
Troubleshooting Diagnosing agent and probe problems

1. Hello World

Accustoms the learner to 'btrace' command and the way it is used. Demonstrates the BTrace ability to instrument a class.

Setup

HelloWorld Class

package extra;

public abstract class HelloWorld extends HelloWorldBase {
    protected int field = 0;

    public static void main(String[] args) throws Exception {
        System.out.println("ready when you are ...");
        System.in.read();

        callA();
    }

    private static void callA() {
        HelloWorld instance = new HelloWorldExt();
        long x = System.nanoTime();
        instance.callA("hello", 13);
        System.out.println("dur = " + (System.nanoTime() - x));
    }

    private void callA(String a, int b) {
        field++;
        callB(callC(a, b));
        field--;
    }

    private void callB(String s) {
        field++;
        System.out.println("You want " + s);
        field--;
    }

    protected abstract String callC(String a, int b);
}

final class HelloWorldExt extends HelloWorld {
    @Override
    protected String callC(String a, int b) {
        try {
            field++;
            String s = a + "-" + b;
            for (int i = 0; i < 100; i++) {
                s = callD(s);
            }
            return s;
        } finally {
            field--;
        }
    }
}

abstract class HelloWorldBase {
    protected final String callD(String s) {
        return "# " + s;
    }
}

Steps

  1. Run the HelloWorld application
  2. Get the HelloWorld application PID via jps command
  3. Run btrace <PID> <HelloWorldTrace.java>
  4. Proceed with the HelloWorld application
  5. Watch messages being printed

You will repeat these steps while gradually enhancing the used BTrace script

Lessons

Lesson 1 - Launching BTrace

Using btrace client to attach to a running JVM

btrace [opts] <pid> <btrace-script> [<args>]

  • opts BTrace specific options; use btrace -h to obtain the list of all supported options
  • pid process id of the traced Java program
  • btrace-script trace program. If it is a ".java", then it is compiled before submission. Or else, it is assumed to be pre-compiled [i.e., it has to be a .class] and submitted.
  • args trace specific arguments

Once you are attached to the target JVM you can press Ctrl-C in the terminal to show the BTrace console. From there you can either detach and exit or send an event (handled by the @OnEvent annotated methods in the trace program).

Starting a Java application with BTrace agent
Directly

java -javaagent:btrace.jar=[<agent-arg>[,<agent-arg>]*]? <launch-args>

The agent takes a list of comma separated arguments.

  • noServer - don't start the socket server
  • bootClassPath - boot classpath to be used
  • systemClassPath - system classpath to be used
  • debug - turns on verbose debug messages (true/false)
  • trusted - do not check for btrace restrictions violations (true/false)
  • dumpClasses - dump the transformed bytecode to files (true/false)
  • dumpDir - specifies the folder where the transformed classes will be dumped to
  • stdout - redirect the btrace output to stdout instead of writing it to an arbitrary file (true/false)
  • probeDescPath - the path to search for probe descriptor XMLs
  • startupRetransform - enable retransform of all the loaded classes at attach (true/false)
  • scriptdir - the path to a directory containing scripts to be run at the agent startup
  • scriptOutputFile - the path to a file the btrace agent will store its output
  • grant - comma-separated list of permissions to grant (e.g. grant=NETWORK,THREADS)
  • deny - comma-separated list of permissions to deny (e.g. deny=EXEC,NATIVE)
  • grantAll - grant all permissions (true/false) - use with caution
  • script - colon separated list of tracing scripts to be run at the agent startup

The scripts to be run must have already been compiled to bytecode (a .class file) by btracec.

Using `btracer'

btracer [opts] <pre-compiled-btrace.class> <vm-arg> <application-args>

  • opts BTrace specific options; use btracer -h to obtain the list of all supported options
  • pre-compiled-btrace.class the trace script compiled to bytecode via btracec
  • vm-args the VM arguments; eg. -cp app.jar Main.class or -jar app.jar
  • application-args the application specific arguments

You can use btracer to launch java application from jar (btracer ... -jar app.jar <application args>) or a main class (btracer ... -cp <class_path> <main class> <application args>)

Compiling trace scripts

This needs to be done in order to launch the Java application with BTrace agent.

btracec [-cp <classpath>] [-d <directory>] <one-or-more-BTrace-.java-files>

  • classpath is the classpath used for compiling BTrace program(s). Default is "."
  • directory is the output directory where compiled .class files are stored. Default is ".".

Rather than regular javac the BTrace compiler is used - causing the script to be validated at compile time and prevent reporting verify errors at runtime.

Inspecting the compiled trace scripts

BTrace compiler will, by default, create a binary trace representation which packages the trace class file together with some metadata designed to make the trace loading and application faster. These packages are not directly readable by tools like javap and one must use btracep instead. The syntax is straightforward - ./btracep <binary trace file>. The tool will print out

  • trace name
  • verification status (trusted or not)
  • transformation status (will cause class retransformation or not)
  • all probe handlers (all @OnProbe, @OnTimer etc. definitions)
  • ASM-ified version of the associated "data holder" class - the class contains the information that needs to be globally accessible from instrumented code

Lesson 2 - Tracing methods

This is the main purpose of BTrace - inject a custom code to custom locations to give the insights about the internal state and dynamics of the application.

Import convention: Code examples in this lesson use import ...; as a shorthand. In practice you need no explicit imports at all: the compiler auto-injects both import static io.btrace.BTrace.*; (utility methods like println, str, monotonic) and import io.btrace.core.annotations.*; (all BTrace annotations). The import ...; placeholder is kept in these examples only to signal which packages are in use. See Lesson 7 for details.

  1. Getting just the information that any method is being executed
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/.*/")
    public static void onMethod() {
        println("Hello from method");
    }
}
  1. Get the method names
package helloworld;
import ...;

@BTrace 
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld")
    public static void onMethod(@ProbeMethodName String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept only a particular method
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callA")
    public static void onMethod(@ProbeMethodName String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept only a particular method with name matching the handler name
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="#")
    public static void callA(@ProbeMethodName String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept methods with names matching certain patterns Note: you can use pattern matching for the class names, too
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/")
    public static void onMethod(@ProbeMethodName String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept methods with names matching certain patterns and inspect their parameters
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/")
    public static void onMethod(@ProbeMethodName String pmn, AnyType[] args) {
        println("Hello from method " + pmn);
        println("Received the following parameters:");
        printArray(args);
    }
}
  1. Intercept method with names matching certain patterns and discover their signatures
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/")
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept methods for all subclasses and implementations of a certain class/interface

Note: 'extra.HelloWorldBase.callD()' doesn't show up - it is defined in the superclass of 'extra.HelloWorld' and therefore not intercepted.

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="+extra.HelloWorld", method="/call.*/")
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept method with a particular name and signature + capture the method arguments (you need to use the information learned in the previous step)

Note: The order of the un-annotated parameters must correspond to the order of the traced method parameters. Annotated parameters may be placed anywhere.

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorldExt", method="callC")
    public static void onMethod(@ProbeMethodName String pmn, String param1, int param2) {
        println("Hello from method " + pmn);
        println("Arguments: param1 = " + str(param1) + ", param2 = " + str(param2));
    }
}
  1. Intercept method with a particular name and signature but don't capture the method arguments. Here you will need to decifer the VM method signature to get java like method signature. See the @OnMethod.type() javadoc for the java like signature format.

Eg. having the VM method signature in form of (Ljava/lang/String;I)V will translate to "void (java.lang.String, int)"

Note: We are using overloaded method here and specifying the signature helps BTrace determine which method should be instrumented

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callA", type="void (java.lang.String, int)")
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn) {
        println("Hello from method " + pmn);
    }
}
  1. Intercept method with a particular name and capture its return value location=@Location(Kind.RETURN) sets up the instrumentation to be inserted just before the method exits
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callC", location=@Location(Kind.RETURN))
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @Return String ret) {
        println("Hello from method " + pmn + "; returning " + ret);
    }
}
  1. Inspect the content of an instance variable in the method declaring class
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/", location=@Location(Kind.RETURN))
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @Self Object thiz) {
        println("Hello from method " + pmn);
        println("field = " + str(getInt("field", thiz)));
    }
}

Or retrieve the java.lang.Field instance first and perform a check before trying to retrieve the field value.

package helloworld;
import java.lang.Class;
import java.lang.reflect.Field;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/", location=@Location(Kind.RETURN))
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @Self Object thiz) {
        Class myClz = classOf(thiz);
        Field fld = field(myClz, "field", false);
        println("Hello from method " + pmn);
        if (fld != null) {
           println("field = " + str(getInt(fld, thiz)));
        }
    }
}
  1. Get the method execution duration

Note: Need to use @Location(Kind.RETURN) to be able to capture the execution duration

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/", location=@Location(Kind.RETURN))
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @Duration long dur) {
        println("Hello from method " + pmn);
        println("It took " + str(dur) + "ns to execute this method");
    }
}
  1. Tracing methods invoked from inside a particular method

Note: 'class', 'method' etc. directly in the @OnMethod annotation will determine where we should look for the invocation of the methods defined by 'class', 'method' etc. parameters in the @Location annotation.

Note: @ProbeMethodName and @ProbeClassName refer to the context method and class; @TargetMethodOrField refers to the traced method invocation

Note: You can use the 'type' annotation parameter in @OnMethod annotation to restrict the context methods and in @Location to restrict the traced method invocations

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callA",
              location=@Location(
                    value = Kind.CALL,
                    clazz = "extra.HelloWorld",
                    method = "/call.*/",
                    where = Where.BEFORE)
    )
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @TargetMethodOrField(fqn = true) String tpmn) {
        println("Hello from method " + pmn);
        println("Going to invoke method " + tpmn);
    }
}
  1. Measuring the duration of methods invoked from inside a particular method
package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callA",
              location=@Location(
                    value = Kind.CALL,
                    clazz = "extra.HelloWorld",
                    method = "/call.*/",
                    where = Where.AFTER)
    )
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @TargetMethodOrField(fqn = true) String tpmn, @Duration long dur) {
        println("Hello from method " + pmn);
        println("Executing " + tpmn + " took " + dur + "ns");
    }
}
  1. Tracing methods invoked from inside a particular method and capturing their parameters

Note: The captured parameters pertain to the invoked method rather than the context method

Note: The @Self annotated parameter captures the context instance and @TargetInstance annotated parameter captures the instance the method is invoked on

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="callA",
              location=@Location(
                    value = Kind.CALL,
                    clazz = "extra.HelloWorld",
                    method = "/call.*/",
                    where = Where.BEFORE)
    )
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn, @TargetMethodOrField(fqn = true) String tpmn,
                                @Self Object thiz, @TargetInstance Object tgt, String a, int b) {
        println("Hello from method " + pmn);
        println("Going to invoke method " + tpmn);
        println("context = " + str(classOf(thiz)) + ", target = " + str(classOf(tgt)));
        println("a = " + a + ", b = " + str(b));
    }
}

Lesson 3 - Global callbacks

Global callbacks are not directly related to the tracing code injection but they allow us to observe the global state and act correspondingly.

Import convention: Same as Lesson 2 — both annotation and utility imports are auto-injected; no explicit imports are required.

@OnExit

Called when the traced application is about to exit. Allows to capture the exit code.

Note: The signature of the handler method MUST be 'void (int)'

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnExit
    public static void onexit(int code) {
        println("Application exitting with " + code);
    }
}

@OnError

Called whenever an exception is thrown from anywhere in the BTrace handlers.

Note: The signature of the handler method MUST be 'void (java.lang.Throwable)'

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnError
    public static void onerror(Throwable t) {
        println("Encountered internal error " + str(t));
    }
}

@OnTimer

Allows to register a handler to be invoked periodically at defined intervals.

Note: The annotation parameter takes the interval value in milliseconds

Note: The signature of the handler method MUST be 'void ()'

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnTimer(1000)
    public static void ontimer() {
        println("tick ...");
    }
}

@OnEvent

Used to raise events from external clients (eg. the command line client). The annotation takes a String parameter which is the event name. When not provided the event is considered to be 'unnamed'.

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnEvent
    public static void unnamed() {
        println("Received unnamed event");
    }
}

or

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    @OnEvent("myEvent")
    public static void myevent() {
        println("Received my event");
    }
}

Lesson 4 - Sampling

Tracing many methods being executed frequently can bring a significant overhead to the traced application. And often we are not really interested in the high detail data - an aggregated view would do just fine.

Therefore it is possible to employ statistical sampling to reduce the amount of collected data and related overhead while still providing relevant information about the application behaviour.

The sampling implementation in BTrace guarantees that at least one invocation of a traced method will be recorded, no matter what the sampling settings are.

  1. Intercept only each 10th invocation on average

Note: Even though the 'callD' method is executed 100 times we will get only ~10 hits - as dictated by the 'mean' parameter.

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    private static int cntr = 1;
    @Sampled(kind = Sampled.Sampler.Const, mean = 10)
    @OnMethod(clazz="/extra\\.HelloWorld.*/", method="callD")
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn) {
        println("Hello from method " + pmn + " : " + (cntr++));
    }
}
  1. Let the sampler mean parameter be adjusted dynamically by keeping to the overhead target

Note: In this case the 'mean' parameter actually specifies the lowest number of nanoseconds the average interval between interceptions should be

Note: Because the adaptive sampling needs to collect timestamps in order to maintain the overhead target the lowest value for the 'mean' parameter is 180 (cca. 60ns for getting start/stop timestamp pair multiplied by the safety margin of 3)

Note: The 'callD' method is very short and the number of iteration is rather limited - we will most probably get only one hit here

package helloworld;
import ...;

@BTrace
public class HelloWorldTrace {
    private static int cntr = 1;
    @Sampled(kind = Sampled.Sampler.Adaptive, mean = 50)
    @OnMethod(clazz="/extra\\.HelloWorld.*/", method="callD")
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn) {
        println("Hello from method " + pmn + " : " + (cntr++));
    }
}

Lesson 5 - JFR events

Since BTrace 2.1.0 it is possible to define and use JFR dynamic events directly from the BTrace scripts. This gives immediate access to the high-performance event recording engine built directly in JVM. Being able to observe the script defined events in the bigger context of full application/JVM is an additional benefit when comparing to the 'standard' BTrace way of writing the information to stdout or a dedicated text file.

Define and use a regular JFR event

A new dynamic event type in BTrace is defined via a JfrEvent.Factory instance configured by @Event aannotation. The annotation defines the event metadata and event fields.

Event Metadata
  • name - global event name
  • label - a pretty printed event name
  • description - long description
  • category - arbitrary category in 'directory' format (eg. 'Application/SQL/Updates')
  • stacktrace - whether the event should collect stacktrace or not
Field definition

The fields argument of the @Event annotation defines the array of event fields where each field is defined with the help of @Event.Field annotation. The fields can only be of a supported type (Java primitives + String, Class and Thread) and don't support arrays/lists.

Using the event

BTraceUtils has been enhanced with the following methods to support working with JFR events:

  • prepareEvent(eventFactory) - prepares a new event using the given factory
  • begin(event) - calls begin() event method to start measuring the event time span
  • end(event) - calls end() event method to stop measuring the event time span
  • setEventField(event, field, value) - sets a field value
  • commit(event) - tries and commits the event
Example
@BTrace public class JfrEventsProbe {
    @Event(
        name = "CustomEvent",
        label = "Custom Event",
        fields = {
            @Event.Field(type = Event.FieldType.INT, name = "a"),
            @Event.Field(type = Event.FieldType.STRING, name = "b")
        }
    )
    private static JfrEvent.Factory customEventFactory;

    @OnMethod(clazz = "/.*/", method = "/.*/")
    public static void onMethod() {
        JfrEvent event = prepareEvent(customEventFactory);
        setEventField(event, "a", 10);
        setEventField(event, "b", "hello");
        commit(event);
    }
}
Define and use a periodic JFR event

A periodic JFR event is automatically generated by JFR at a given time interval or at beginning/end of a JFR chunk.

Periodic event handler

A periodic event is defined by an event handler - it is a method by @PeriodEvent annotation, pretty much like @OnMethod or @OnTimer handlers. Similarly to the regular events the annotation defines the event metadata and the event fields. In addition to that the annotation defines the period which can be a time unit like 10 s or 100 ms or it can be one of everyChunk, beginChunk or endChunk.

The handler method takes one parameter of JfrEvent type. The value of this parameter can be used in the BTraceUtils JFR specific methods.

Example
@BTrace public class JfrEventsProbe {
    @PeriodicEvent(name = "PeriodicEvent", fields = @Event.Field(type = Event.FieldType.INT, name = "ts", kind = @Event.Field.Kind(name = Event.FieldKind.TIMESTAMP)), period = "1 s")
    public static void onPeriod(JfrEvent event) {
        if (shouldCommit(event)) {
            setEventField(event, "ts", 1);
            commit(event);
        }
    }
}

Lesson 6 - Extensions and Permissions

BTrace supports extensions that provide additional functionality beyond the core tracing capabilities. Extensions can send metrics to external systems, integrate with DTrace, and more. To ensure safety, extensions require explicit permissions.

See the BTrace Extension Development Guide for the full guide on building and deploying extensions, and the Extension Interface Rules for the API design contract.

Permission System

BTrace uses a permission-based security model organized into three tiers:

Default Permissions (always granted)

These permissions are safe and always available:

  • MESSAGING - Send messages to BTrace client
  • AGGREGATION - Use aggregation functions
  • JFR_EVENTS - Create and use JFR events
  • PROFILING - Use profiling functions
Standard Permissions (granted unless explicitly denied)

These permissions allow read-only access to system information:

  • FILE_READ - Read files from disk (limited paths)
  • SYSTEM_PROPS - Read system properties
  • THREAD_INFO - Read thread information
  • MEMORY_INFO - Read memory and GC information
Privileged Permissions (require explicit grant)

These permissions have security implications and must be explicitly granted:

  • FILE_WRITE - Write files to disk
  • NETWORK - Network I/O (sockets, HTTP)
  • THREADS - Create and manage threads
  • NATIVE - Call native code (JNI, Unsafe)
  • EXEC - Execute external processes
  • REFLECTION - Use reflection
  • CLASSLOADER - Access classloaders
  • UNLIMITED_MEMORY - Unlimited buffer allocation
Permission Model

Permissions are defined by extension/service descriptors and enforced via agent grants. The Gradle plugin writes the effective permission set into the extension’s manifest.

Do Not Instantiate Types in Probes
  • Probes do not construct arbitrary objects (new is not allowed). Instead, obtain builders or factories from injected services and pass built configuration handles back to the service.
  • This keeps hot paths allocation-free and within verifier constraints (only BTraceUtils, injected services, or objects returned from services can be used).

Example:

@BTrace
class Example {
  @Injected static MetricsService metrics;
  static final HistogramConfig cfg;
  static {
    cfg = metrics.newHistogramConfig().unit("ns").build();
  }
  @OnMethod(clazz="com.example.Foo", method="bar", location=@Location(Kind.RETURN))
  static void onReturn(@Duration long d) {
    metrics.histogram("foo.bar.latency", cfg).record(d);
  }
}
Granting Permissions at Runtime

When running a probe that requires privileged permissions, you must explicitly grant them:

Using the btrace client
btrace --grant=NETWORK,THREADS <pid> MetricsProbe.class
Using the Java agent
java -javaagent:btrace.jar=script=MetricsProbe.class,grant=NETWORK,THREADS ...
Grant all permissions (use with caution)
btrace --grantAll=true <pid> MetricsProbe.class

or

java -javaagent:btrace.jar=script=MetricsProbe.class,grantAll=true ...
Per-Extension Allow/Deny (Simplified Policy)
  • Allow specific extensions to link implementations via agent args:
    • -javaagent:btrace.jar=...,allowExtensions=btrace-statsd,my-metrics
  • Deny specific extensions (implementation blocked; SHIM fallback):
    • -javaagent:btrace.jar=...,denyExtensions=legacy-foo
  • Allow all privileged extensions:
    • -javaagent:btrace.jar=...,allowPrivileged=true
  • Optional process-local policy file:
    • -Dbtrace.permissions=/path/to/permissions.properties or ~/.btrace/permissions.properties
    • Example content:
      • allowExtensions=btrace-statsd
      • denyExtensions=legacy-foo
      • allowPrivileged=false
  • Extensions continue to expose required permissions in logs to help operators decide whether to allow them.
Permission Error Messages

If a probe requires permissions that are not granted, BTrace will display a descriptive error message:

Probe requires permissions that are not granted:

  - NETWORK
    Network I/O (sockets, HTTP). Risk: Data exfiltration, remote connections.
  - THREADS
    Create and manage threads. Risk: Resource exhaustion, concurrent operations.

To allow these permissions, use:
  --grant=NETWORK,THREADS

Or use --grantAll=true to allow all permissions (not recommended).
Inspecting Probe Permissions

Use the btracep tool to inspect what permissions a compiled probe requires:

btracep MetricsProbe.class

The output will include a "Required permissions" line listing all permissions the probe needs.

Using the StatsdExtension

The StatsdExtension allows sending metrics to a StatsD server:

@BTrace
public class StatsdExample {
    @Injected
    private static StatsdExtension statsd;

    @OnMethod(clazz = "com.example.API", method = "handleRequest",
              location = @Location(Kind.RETURN))
    public static void onRequest(@Duration long duration) {
        statsd.increment("api.requests");
        statsd.timing("api.latency", duration / 1_000_000);
    }
}

Run with:

btrace --grant=NETWORK,THREADS -statsd localhost:8125 <pid> StatsdExample.class
Using the Histogram Metrics Extension (btrace-metrics)

The histogram metrics extension provides high-performance in-process metrics using HdrHistogram. It does not require network permissions and runs entirely inside the target JVM.

Requirements:

  • Build the distribution so extensions are exploded under BTRACE_HOME/extensions/.
  • The agent discovers built-in extensions automatically; no additional flags are needed.

Example:

package myprobes;

import io.btrace.core.annotations.*;
import io.btrace.metrics.MetricsService;
import io.btrace.metrics.histogram.HistogramConfig;
import io.btrace.metrics.histogram.HistogramMetric;
import io.btrace.metrics.histogram.HistogramSnapshot;
import io.btrace.metrics.stats.StatsMetric;
import io.btrace.metrics.stats.StatsSnapshot;

@BTrace
public class HistogramExample {
    // ServiceType hint is optional; omit for defaults
    @Injected
    private static MetricsService metrics;

    private static HistogramMetric histogram;
    private static StatsMetric stats;

    @OnMethod(clazz = "com.example.Service", method = "doWork")
    public static void onEntry() {
        if (histogram == null) {
            histogram = metrics.histogramMicros("service.doWork");
            stats = metrics.stats("service.doWork.stats");
        }
    }

    @OnMethod(clazz = "com.example.Service", method = "doWork", location = @Location(Kind.RETURN))
    public static void onReturn(@Duration long durationNanos) {
        long durationMicros = durationNanos / 1000;
        histogram.record(durationMicros);
        stats.record(durationMicros);
    }

    @OnTimer(1000)
    public static void onTimer() {
        HistogramSnapshot h = histogram.snapshot();
        StatsSnapshot s = stats.snapshot();

        println("=== Metrics Report ===");
        println("Count: " + s.count());
        println("Mean: " + s.mean() + " μs");
        println("Min: " + s.min() + " μs");
        println("Max: " + s.max() + " μs");
        println("P50: " + h.p50() + " μs");
        println("P95: " + h.p95() + " μs");
        println("P99: " + h.p99() + " μs");
        println("======================");
    }
}

Run with:

btrace <pid> HistogramExample.java

You should see a periodic metrics report similar to:

=== Metrics Report ===
Count: 4
Mean: 4178.5 μs
Min: 118 μs
Max: 16341 μs
P50: 120 μs
P95: 16343 μs
P99: 16343 μs
======================

Configuration (optional):

  • You can tune defaults in btrace.conf (see Architecture: Extension Configuration):
    • btrace-metrics.histogram.default-precision=3
    • btrace-metrics.histogram.max-value=3600000000
Building App Integration Extensions with @ExternalType

When an extension needs to interact with application-specific types (Spark events, Hadoop objects, custom framework classes), use the @ExternalType annotation to eliminate manual reflective boilerplate. The BTrace extension plugin auto-registers the annotation processor.

Declare an interface in src/main/java annotated with @ExternalType("fully.qualified.AppType"):

// src/main/java/org/example/spark/api/JobStartEvent.java
@ExternalType("org.apache.spark.scheduler.SparkListenerJobStart")
public interface JobStartEvent {
    int jobId();
    long time();
}

The processor generates JobStartEvent$Ext with lazy, cached MethodHandle dispatchers. Use them directly in the impl — no try/catch or cache setup required:

// src/main/java/org/example/spark/impl/SparkApiImpl.java
public final class SparkApiImpl extends Extension implements SparkApi {
    @Override
    public void onJobStart(Object event) {
        int  id = JobStartEvent$Ext.jobId(event);
        long ts = JobStartEvent$Ext.time(event);
        // emit metrics / logs...
    }
}

Resolution happens lazily on the first call via the event object's own class loader; the handle is cached for subsequent calls. If the external class isn't loaded yet, the resolver retries next call — no ExceptionInInitializerError at extension startup.

Field access, constructors, non-public methods, and chained external types are not yet handled by the processor — they are planned for a future @ExternalType version. Use ClassLoadingUtil / MethodHandleCache directly in the meantime (see Provided-Style Extensions for the full scope-limits table and workarounds).

For the full @ExternalType reference, see BTrace Extension Development Guide.

Zero-Config Probe Auto-Selection (ExtensionConfigurator)

For fat agent deployments (Spark executors, Hadoop nodes, Kubernetes pods), extensions can automatically select the right bundled probes without operator input.

1. List probe names and declare the configurator in extension.properties:

probes=DriverTracer,ExecutorTracer
configurator=org.example.spark.SparkConfigurator

2. Stage the compiled probe .class files in the fat agent plugin:

btraceFatAgent {
    embedExtensions { project(':my-spark-extension') }
    bundledProbes { from 'src/probes/compiled' }
}

3. Implement ExtensionConfigurator:

public final class SparkConfigurator implements ExtensionConfigurator {
    @Override
    public ProbeConfiguration configure(RuntimeEnvironment env, Map<String, String> args) {
        ProbeConfiguration config = new ProbeConfiguration();
        if (env.hasClass("org.apache.spark.SparkContext")) {
            config.enable("DriverTracer");
        } else if (env.hasClass("org.apache.spark.executor.Executor")) {
            config.enable("ExecutorTracer");
        }
        return config;
    }
}

RuntimeEnvironment provides hasClass(String), getSystemProperty(String), getEnv(String), and getMainClassName() for environment detection.

4. Operator attaches — no probes= argument needed:

java -javaagent:my-btrace-agent-fat.jar spark-submit ...

If the operator supplies probes=, it takes priority and the configurator is skipped. See Extension Development Guide — Bundled Probes for the full API.

Troubleshooting Failed Extensions

If extensions fail to load during agent initialization (for example, due to missing dependencies or configuration issues), BTrace will display a warning when you submit a probe:

[BTRACE WARN] 1 extension(s) failed to load:
  - StatsdExtension: Missing manifest metadata (ensure Gradle plugin is applied and configured)
Use 'btrace -le <PID>' for details.
Listing Failed Extensions

Use the -le option to see detailed information about failed extensions:

btrace -le <pid>

This will display all extensions that failed to load and the reasons for their failures:

Failed Extensions:
  1. io.btrace.statsd.StatsdExtension: Connection refused to localhost:8125
  2. io.btrace.dtrace.DTraceExtension: DTrace not available on this platform
Interactive Menu

When attached to a JVM in interactive mode (press Ctrl-C), you can also select option 7 to list failed extensions:

Please enter your option:
        1. exit
        2. send an event
        3. send a named event
        4. flush console output
        5. list probes
        6. detach client
        7. list failed extensions

This is useful for diagnosing issues when probes that rely on specific extensions are not working as expected.

Writing Your Own Extension (Quick Start)

Extensions are standalone Gradle projects that expose a typed service interface to BTrace scripts. The full workflow is covered in the BTrace Extension Development Guide; the steps below are the minimum to get started.

1. Apply the BTrace extension Gradle plugin

plugins {
    id("io.btrace.extension") version "<btraceVersion>"
}

btraceExtension {
    id = "com.example.myext"
    services = ["com.example.myext.api.MyService"]
}

2. Define the service API (what scripts see)

// src/main/java/com/example/myext/api/MyService.java
package com.example.myext.api;

public interface MyService {
    void record(String key, long value);
}

3. Implement the service (stays isolated from scripts)

// src/main/java/com/example/myext/impl/MyServiceImpl.java
package com.example.myext.impl;

import com.example.myext.api.MyService;
import io.btrace.ext.spi.Extension;

public class MyServiceImpl extends Extension implements MyService {
    @Override
    public void record(String key, long value) {
        // emit metric, write to log, etc.
    }
}

4. Inject and use the service in a script

import io.btrace.core.annotations.*;
import com.example.myext.api.MyService;

@BTrace
public class MyProbe {
    @Injected
    private static MyService myService;

    @OnMethod(clazz="com.example.App", method="processRequest",
              location=@Location(Kind.RETURN))
    public static void onReturn(@Duration long d) {
        myService.record("app.processRequest", d);
    }
}

The plugin generates the service descriptor manifest, computes the required permission set, and produces a distributable ZIP that installs under $BTRACE_HOME/extensions/. See the BTrace Extension Development Guide for detailed coverage of: classloader isolation, permission declarations, @ExternalType adapters, fat-agent embedding, and the full API design rules documented in ExtensionInterfaceRules.md.


Lesson 7 - Flat DSL

BTrace 2.3+ ships a flat DSL class io.btrace.BTrace that exposes the most common helper operations as plain static methods. The compiler automatically injects two imports into every script:

  • import static io.btrace.BTrace.*; — all flat DSL helpers (println, str, monotonic, …)
  • import io.btrace.core.annotations.*; — all BTrace annotations (@BTrace, @OnMethod, @TLS, @Level, …)

This means a minimal BTrace script requires zero import statements — just annotate your class and write the probe logic.

What the flat DSL provides
Category Methods
Output print(s), println(s), println(), printf(fmt, args...)
Strings str(o), concat(a,b), substr(s,start,end), matches(s,regex), startsWith(s,prefix), endsWith(s,suffix), length(s)
Numbers abs(l/d), min(a,b), max(a,b)
Time timestamp() (wall clock ms), monotonic() (nanos)
Thread currentThread(), threadName(t), threadId(t)
Stack stackTrace(), printStack(), stackDepth()
Object className(o), identity(o), size(o)
Control exit(code)

For everything else (aggregations, reflection, profiling, JFR, data structures, thread operations) use BTraceUtils or an injected extension service.

Writing scripts without explicit imports

Before (classic style):

import io.btrace.core.BTraceUtils;
import io.btrace.core.annotations.*;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/.*/")
    public static void onMethod(@ProbeMethodName String pmn) {
        BTraceUtils.println("Hello from method " + pmn);
    }
}

After (modern zero-import style — both imports auto-injected by the compiler):

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/.*/")
    public static void onMethod(@ProbeMethodName String pmn) {
        println("Hello from method " + pmn);
    }
}

You can still write the imports explicitly if you prefer (for IDE auto-complete, for example) — the compiler skips injection when they are already present:

import static io.btrace.BTrace.*;
import io.btrace.core.annotations.*;

@BTrace
public class HelloWorldTrace { ... }
Mixing flat DSL with BTraceUtils

The flat DSL covers the most frequent operations. When you need something it does not offer, import BTraceUtils directly alongside it:

import io.btrace.core.BTraceUtils;
import io.btrace.core.annotations.*;

@BTrace
public class HelloWorldTrace {
    @OnMethod(clazz="extra.HelloWorld", method="/call.*/", location=@Location(Kind.RETURN))
    public static void onMethod(@ProbeMethodName(fqn = true) String pmn,
                                @Self Object thiz) {
        // println/str come from the flat DSL (auto-injected)
        println("Hello from method " + pmn);
        // getInt/classOf come from BTraceUtils (explicit import)
        println("field = " + str(BTraceUtils.getInt("field", thiz)));
    }
}
How it works under the hood

At compile time, every INVOKESTATIC targeting io.btrace.BTrace.* is rewritten by the post-processor to an INVOKEDYNAMIC instruction that routes the call through BTraceBootstrap. This gives probe code stronger isolation from the target application's classloader hierarchy without any visible change in the script source.


Lesson 8 - Thread-Local Storage

When a probe spans two separate handler methods — for example, recording a start time at method entry and computing duration at method return — you need per-thread state. BTrace provides the @TLS annotation for exactly this purpose: a field marked @TLS behaves like a ThreadLocal, with each thread seeing its own independent copy.

Note: Fields annotated with @TLS must be static. Primitive types and String are the most reliable choices; object references work too but require care because the field is null-initialized per thread on first access.

Measuring per-thread duration
import io.btrace.core.annotations.*;

@BTrace
public class HelloWorldTrace {
    @TLS
    private static long startTime;   // one copy per thread

    @OnMethod(clazz="extra.HelloWorld", method="/call.*/")
    public static void onEntry(@ProbeMethodName(fqn = true) String pmn) {
        startTime = monotonic();     // record entry time for this thread
        println("Entering " + pmn);
    }

    @OnMethod(clazz="extra.HelloWorld", method="/call.*/",
              location=@Location(Kind.RETURN))
    public static void onReturn(@ProbeMethodName(fqn = true) String pmn) {
        long dur = monotonic() - startTime;
        println(pmn + " took " + str(dur) + " ns");
    }
}

Note: @TLS is reset to its zero/null value when the thread exits, so it is safe under thread pools where threads are reused across requests.


Lesson 9 - Instrumentation Levels

Tracing everything at once can be overwhelming and expensive. The @Level annotation lets you define multiple probe handlers that are enabled at different instrumentation levels. At runtime you switch the active level through @OnEvent handlers, dialing the detail up or down without stopping and restarting the probe.

Defining level-aware handlers
import io.btrace.core.BTraceUtils;
import io.btrace.core.annotations.*;

@BTrace
public class HelloWorldTrace {
    /**
     * Level 0 (default): only trace entry into HelloWorld's own methods.
     * Enabled when the instrumentation level equals exactly 0.
     */
    @OnMethod(clazz="extra.HelloWorld", method="/.*/",
              enableAt=@Level("=0"))
    public static void onMethodL0(@ProbeMethodName(fqn = true) String pmn) {
        println("[L0] " + pmn);
    }

    /**
     * Level 1+: trace the whole class hierarchy including subclasses.
     * Enabled when the instrumentation level is 1 or higher.
     */
    @OnMethod(clazz="+extra.HelloWorld", method="/.*/",
              enableAt=@Level(">=1"))
    public static void onMethodL1(@ProbeMethodName(fqn = true) String pmn) {
        println("[L1] " + pmn);
    }

    /** Raise to level 1 by sending a named event. */
    @OnEvent("verbose")
    public static void setVerbose() {
        BTraceUtils.setInstrumentationLevel(1);
        println("Switched to verbose tracing (level 1)");
    }

    /** Drop back to level 0. */
    @OnEvent("quiet")
    public static void setQuiet() {
        BTraceUtils.setInstrumentationLevel(0);
        println("Switched back to quiet tracing (level 0)");
    }
}
Level expression syntax
Expression Meaning
@Level (no value) Always enabled (default)
@Level("N") Level >= N (shorthand for >=N)
@Level("=N") Exactly level N
@Level(">N") Strictly greater than N
@Level(">=N") Greater than or equal to N
@Level("<N") Strictly less than N
@Level("<=N") Less than or equal to N

Where N is a non-negative integer.

Switching levels from the command line

While attached interactively (press Ctrl-C), choose option 2 or 3 to send an event. For the example above:

Please enter your option: 3
Please enter event name: verbose

The setVerbose() handler fires immediately and BTrace retransforms the target classes to activate the higher-detail probes.


Lesson 10 - Oneliners

For quick, ad-hoc investigations you do not need a .java file at all. BTrace supports oneliners via the -n flag: a compact, DTrace-inspired syntax that compiles to a full BTrace script internally.

btrace -n 'CLASS::METHOD @LOCATION [filter] { ACTION }' <PID>
Components
Part Description
CLASS Class name, wildcard (*, **), or /regex/
METHOD Method name, wildcards, or /regex/; <init> for constructors
@LOCATION @entry, @return, or @error
filter Optional: if duration>100ms or if args[0]=="VALUE"
ACTION One or more of print, count, time, stack(N), separated by commas
Available identifiers in actions
Identifier Available at Description
method all Method name
class all Class name
args all Method arguments array
self all Receiver (this)
duration @return, @error Execution time in nanoseconds
return @return Return value
Examples

Print every call into callA:

btrace -n 'extra.HelloWorld::callA @entry { print method, args }' <PID>

Find calls slower than 5 ms:

btrace -n 'extra.HelloWorld::* @return if duration>5ms { print method, duration }' <PID>

Count all file opens:

btrace -n 'java.io.FileInputStream::<init> @entry { count }' <PID>

Print the stack when an IOException is constructed:

btrace -n 'java.io.IOException::<init> @entry { print args, stack(8) }' <PID>

Match methods by regex and time them:

btrace -n '/com\.example\..*/::/handle.*/ @return { time }' <PID>
Limitations
  • Single probe point per oneliner (no |-separated multi-probe)
  • No @call location, no field access probes
  • No aggregations (histograms, averages) — use a full script for those
  • Filters support simple comparisons only; no &&/||

For complex scenarios — multiple probe points, state across probes, aggregations — convert to a full BTrace script. See the Oneliner Guide for the complete syntax reference and more examples.


Lesson 11 — Runtime Contracts (btrace-contracts)

Extension ID: btrace-contracts
Service class: io.btrace.contracts.ContractService

ContractService enforces behavioral invariants at runtime — latency budgets, call-rate limits, null checks, value-range assertions — without modifying the target code. All checks are non-throwing: a violation is counted internally and the target application keeps running. It also tracks call count and average latency per user-supplied tag, so you can compare any two code paths side by side (e.g. "cached" vs "direct", "v1" vs "v2").

API

void checkLatency(String contract, long durationNanos, long budgetNanos)
void checkCallRate(String contract, int maxPerSecond)
void assertCondition(String contract, boolean condition, String message)
void checkRange(String contract, long value, long min, long max)
void checkNotNull(String contract, Object value)
void trackCodePath(String contract, long durationNanos, String tag)
String getSummary()
boolean hasViolations()
long getTotalViolations()

Example — enforcing contracts on a service endpoint

import io.btrace.contracts.ContractService;
import io.btrace.core.annotations.*;
import static io.btrace.BTrace.*;

@BTrace
public class ContractCheck {

    @Injected private static ContractService contracts;

    @OnMethod(
        clazz  = "com.example.RecommendationService",
        method = "recommend",
        location = @Location(Kind.ENTRY)
    )
    public static void onEntry() {
        contracts.checkCallRate("recommend/rate", 10);
    }

    @OnMethod(
        clazz  = "com.example.RecommendationService",
        method = "recommend",
        location = @Location(Kind.RETURN)
    )
    public static void onReturn(@Return Object result, @Duration long dur) {
        contracts.checkLatency("recommend/latency", dur, 500_000_000L);
        contracts.checkNotNull("recommend/non-null-result", result);
        contracts.trackCodePath("recommend", dur, "impl-a");
    }

    @OnEvent
    public static void report() {
        println(contracts.getSummary());
        if (contracts.hasViolations()) {
            println(strcat("Total violations: ", str(contracts.getTotalViolations())));
        }
    }

    @OnTimer(15000)
    public static void periodicReport() { report(); }
}

Copy btrace-contracts.jar to $BTRACE_HOME/extensions/, then run:

btrace <PID> ContractCheck.java

Lesson 12 — AI/LLM Application Observability

Modern Java applications increasingly embed LLM inference, RAG pipelines, and on-device model execution. BTrace ships three optional extension JARs that add purpose-built services for observing these workloads without modifying application code.

All three extensions use the standard @Injected mechanism: declare a field in your BTrace script, annotate it, and BTrace wires up the implementation at deploy time. No reflection, no extra threads, no allocation on the hot path.

12.1 LLM Inference Tracing (btrace-llm-trace)

Extension ID: btrace-llm-trace
Service class: io.btrace.llm.LlmTraceService
Builder class: io.btrace.llm.CallRecord

LlmTraceService records LLM API calls: token counts (input, output, cache-read, cache-creation), latency, streaming time-to-first-token, errors, tool calls, and embeddings. It maintains per-model statistics — call count, total tokens, and latency min/mean/max — and estimates cost using a built-in pricing table covering Claude, GPT-4o, Gemini, and other common models. The implementation is thread-safe and allocation-free on the hot path via a ThreadLocal-pooled builder.

Simple API
void recordCall(String model, long durationNanos)
void recordCall(String model, int inputTokens, int outputTokens, long durationNanos)
CallRecord call(String model)            // returns a fluent builder
void recordEmbedding(String model, int tokenCount, long durationNanos)
void recordToolUse(String model, String toolName)
void recordError(String model, String errorType, long durationNanos)
String getSummary()
double getEstimatedCostUsd()

The fluent builder lets you attach every detail of a call in one expression:

llm.call("claude-sonnet-4-20250514")
    .provider("anthropic")
    .inputTokens(1500)
    .outputTokens(300)
    .cacheReadTokens(800)
    .streaming()
    .timeToFirstToken(200_000_000L)
    .duration(durationNanos)
    .record();
Full example — instrumenting LangChain4j

The following script instruments the ChatLanguageModel.generate() method from the LangChain4j library and extracts token usage from the AiMessage response via @Return:

import io.btrace.llm.LlmTraceService;
import io.btrace.llm.CallRecord;
import io.btrace.core.annotations.*;
import static io.btrace.BTrace.*;

@BTrace
public class LlmTrace {

    @Injected
    private static LlmTraceService llm;

    // @Duration is available at Kind.RETURN and Kind.ERROR — no manual timestamps needed.
    // @ProbeClassName identifies the concrete implementation class at the probe site.

    @OnMethod(
        clazz  = "+dev.langchain4j.model.chat.ChatLanguageModel",
        method = "generate",
        location = @Location(Kind.RETURN)
    )
    public static void onChatReturn(@ProbeClassName String cls,
                                    @Return Object response,
                                    @Duration long durationNanos) {
        // Extract token counts via BTrace field-access helpers.
        // TokenUsage is nested inside Response<AiMessage>.
        Object tokenUsage = get(
            field("dev.langchain4j.model.output.Response", "tokenUsage"),
            response);
        int inputTokens  = 0;
        int outputTokens = 0;
        if (tokenUsage != null) {
            inputTokens  = (Integer) get(
                field("dev.langchain4j.model.output.TokenUsage", "inputTokenCount"),
                tokenUsage);
            outputTokens = (Integer) get(
                field("dev.langchain4j.model.output.TokenUsage", "outputTokenCount"),
                tokenUsage);
        }

        llm.call(cls)
            .inputTokens(inputTokens)
            .outputTokens(outputTokens)
            .duration(durationNanos)
            .record();
    }

    @OnMethod(
        clazz  = "+dev.langchain4j.model.chat.ChatLanguageModel",
        method = "generate",
        location = @Location(Kind.ERROR)
    )
    public static void onChatError(@ProbeClassName String cls,
                                   Throwable t,
                                   @Duration long durationNanos) {
        llm.recordError(cls, str(classOf(t)), durationNanos);
    }

    // Print a summary on demand (btrace send event) or every 60 seconds.
    @OnEvent
    public static void printSummary() {
        println(llm.getSummary());
        println(strcat("Estimated cost: $",
                       str(llm.getEstimatedCostUsd())));
    }

    @OnTimer(60000)
    public static void periodicSummary() {
        printSummary();
    }
}

Copy btrace-llm-trace.jar to $BTRACE_HOME/extensions/ — the agent discovers and loads it automatically. Then run as normal:

btrace <PID> LlmTrace.java

12.2 RAG Pipeline Observability (btrace-rag-quality)

Extension ID: btrace-rag-quality
Service class: io.btrace.rag.RagQualityService
Builder class: io.btrace.rag.QueryRecord

RagQualityService tracks vector database query performance, result counts, similarity scores, and end-to-end pipeline latency broken down into retrieval and generation phases. It counts empty retrievals and chunk token sizes and works with any vector store: Pinecone, Milvus, Weaviate, Chroma, pgvector, Qdrant, and others.

Simple API
void recordQuery(String source, long durationNanos)
void recordQuery(String source, int resultCount, long durationNanos)
QueryRecord query(String source)         // returns a fluent builder
void recordPipeline(String pipelineName, long retrievalNanos, long generationNanos)
void recordChunk(String source, int chunkTokens)
void recordEmptyRetrieval(String source)
String getSummary()
float getAverageTopScore()

Fluent builder:

rag.query("pinecone")
    .resultCount(5)
    .topScore(0.92f)
    .lowScore(0.71f)
    .embeddingDimension(1536)
    .duration(durationNanos)
    .record();
Full example — RAG pipeline with Pinecone and LangChain4j

This script instruments the retrieval call to a Pinecone-backed EmbeddingStore and the downstream LLM call together, giving a unified view of the pipeline:

import io.btrace.rag.RagQualityService;
import io.btrace.rag.QueryRecord;
import io.btrace.llm.LlmTraceService;
import io.btrace.core.annotations.*;
import static io.btrace.BTrace.*;

@BTrace
public class RagPipeline {

    @Injected private static RagQualityService rag;
    @Injected private static LlmTraceService   llm;

    // Carries the retrieval duration from the retrieval handler into the pipeline
    // handler, which runs after retrieval completes on the same thread.
    @TLS private static long lastRetrievalDur;

    // --- Retrieval phase ---

    @OnMethod(
        clazz  = "+dev.langchain4j.store.embedding.EmbeddingStore",
        method = "findRelevant",
        location = @Location(Kind.RETURN)
    )
    public static void onRetrievalReturn(@Return Object results,
                                         @ProbeClassName String store,
                                         @Duration long dur) {
        int count = (results != null) ? (Integer) call(
            method("java.util.List", "size"), results) : 0;

        if (count == 0) {
            rag.recordEmptyRetrieval(store);
        } else {
            rag.query(store)
               .resultCount(count)
               .duration(dur)
               .record();
        }
        lastRetrievalDur = dur;
    }

    // --- Generation phase ---

    @OnMethod(
        clazz  = "+dev.langchain4j.chain.ConversationalRetrievalChain",
        method = "execute",
        location = @Location(Kind.ENTRY)
    )
    public static void onPipelineEntry() {
        lastRetrievalDur = 0;
    }

    @OnMethod(
        clazz  = "+dev.langchain4j.chain.ConversationalRetrievalChain",
        method = "execute",
        location = @Location(Kind.RETURN)
    )
    public static void onPipelineReturn(@ProbeClassName String cls,
                                        @Duration long totalNanos) {
        long generationNanos = totalNanos - lastRetrievalDur;
        rag.recordPipeline(cls, lastRetrievalDur, generationNanos);
        llm.recordCall(cls, generationNanos);
    }

    @OnEvent
    public static void printSummary() {
        println("=== RAG Summary ===");
        println(rag.getSummary());
        println(strcat("Average top similarity score: ",
                       str(rag.getAverageTopScore())));
        println("=== LLM Summary ===");
        println(llm.getSummary());
    }

    @OnTimer(30000)
    public static void periodicSummary() {
        printSummary();
    }
}

Copy both extension JARs to $BTRACE_HOME/extensions/, then run:

btrace <PID> RagPipeline.java

12.3 GPU and Inference Observability (btrace-gpu-bridge)

Extension ID: btrace-gpu-bridge
Service class: io.btrace.gpu.GpuBridgeService
Builder class: io.btrace.gpu.InferenceRecord

GpuBridgeService traces the boundary between JVM code and GPU-accelerated inference runtimes: ONNX Runtime, Deep Java Library (DJL), TensorFlow Java, and Panama FFM calls into CUDA/ROCm native libraries. It tracks batch sizes, tensor dimensions, device type and ID, GPU memory allocation and deallocation, and model load times.

Simple API
void recordInference(String runtime, String modelName, long durationNanos)
void recordInference(String runtime, String modelName, int batchSize, long durationNanos)
InferenceRecord inference(String runtime, String modelName)   // fluent builder
void recordMemoryAlloc(String deviceType, int deviceId, long bytes)
void recordMemoryFree(String deviceType, int deviceId, long bytes)
void recordNativeCall(String library, String function, long durationNanos)
void recordModelLoad(String runtime, String modelName, long durationNanos)
String getSummary()
long getCurrentGpuMemoryBytes()
long getPeakGpuMemoryBytes()
Example — instrumenting ONNX Runtime
import io.btrace.gpu.GpuBridgeService;
import io.btrace.gpu.InferenceRecord;
import io.btrace.core.annotations.*;
import static io.btrace.BTrace.*;

@BTrace
public class OnnxTrace {

    @Injected private static GpuBridgeService gpu;

    // @Self is available at Kind.RETURN, so no TLS is needed to carry the session reference.
    // @Duration provides the call duration without manual timestamp diffing.

    @OnMethod(
        clazz  = "ai.onnxruntime.OrtSession",
        method = "run",
        location = @Location(Kind.RETURN)
    )
    public static void onRunReturn(@Self Object session, @Duration long dur) {
        String modelPath = str(get(
            field("ai.onnxruntime.OrtSession", "modelPath"), session));
        gpu.inference("onnxruntime", modelPath)
           .duration(dur)
           .record();
    }

    @OnMethod(
        clazz  = "ai.onnxruntime.OrtSession",
        method = "run",
        location = @Location(Kind.ERROR)
    )
    public static void onRunError(Throwable t, @Duration long dur) {
        gpu.recordNativeCall("onnxruntime", "run/error", dur);
    }

    // @Duration at Kind.RETURN gives the load duration directly; no ENTRY handler needed.
    @OnMethod(
        clazz  = "ai.onnxruntime.OrtEnvironment",
        method = "createSession",
        location = @Location(Kind.RETURN)
    )
    public static void onLoadReturn(@Return Object sess, @Duration long dur) {
        String mp = str(get(
            field("ai.onnxruntime.OrtSession", "modelPath"), sess));
        gpu.recordModelLoad("onnxruntime", mp, dur);
    }

    @OnEvent
    public static void printSummary() {
        println(gpu.getSummary());
        println(strcat("Current GPU memory: ",
                       str(gpu.getCurrentGpuMemoryBytes() / (1024 * 1024)) + " MB"));
        println(strcat("Peak GPU memory:    ",
                       str(gpu.getPeakGpuMemoryBytes()    / (1024 * 1024)) + " MB"));
    }

    @OnTimer(60000)
    public static void periodicSummary() {
        printSummary();
    }
}

Copy btrace-gpu-bridge.jar to $BTRACE_HOME/extensions/, then run:

btrace <PID> OnnxTrace.java

Using multiple AI extensions together

All four extensions can be combined in a single script. Copy all the extension JARs you need to $BTRACE_HOME/extensions/ — the agent discovers and loads them automatically:

cp btrace-llm-trace.jar btrace-rag-quality.jar \
   btrace-gpu-bridge.jar \
   $BTRACE_HOME/extensions/
btrace <PID> MyAIObservability.java

When using the fat agent at JVM startup, the same rule applies — extensions under $BTRACE_HOME/extensions/ are loaded automatically:

java -javaagent:btrace.jar=script=MyAIObservability.java -jar myapp.jar

Use @OnEvent for on-demand reporting triggered by btrace send event <PID> and @OnTimer for periodic snapshots. Both can coexist in the same script.


Lesson 13 — BTrace MCP Server: AI Agents as Diagnosticians

What is MCP?

MCP (Model Context Protocol) is a protocol that lets AI assistants call external tools during a conversation. Instead of the AI only producing text, it can invoke structured operations — search, read a file, query a database — and incorporate the results into its response. The BTrace MCP server exposes BTrace operations as MCP tools, so an LLM client such as Claude Desktop or Claude Code can attach to running JVMs, deploy probes, read output, and clean up — all through natural language conversation.

How the BTrace MCP server works

The BTrace MCP server runs as a local subprocess on the same machine as the target JVM. The AI client starts and manages the server process; you do not need to keep a terminal open for it. When the AI calls a BTrace tool, the server forwards the request to the BTrace agent (or attaches one if none is present) and returns the result. Because the server only connects to local JVMs, and because BTrace's safety model — no loops, no allocation, no exceptions, no field assignment — still applies to every probe, the AI cannot break or hang the target application.

Starting the server manually

For testing or scripting, you can start the MCP server directly:

java -jar btrace-mcp-server.jar

By default the server listens on stdio (the MCP transport used by most clients) and connects to the BTrace agent on port 2020. Override the port with:

java -Dbtrace.port=2021 -jar btrace-mcp-server.jar

Or via the environment variable BTRACE_PORT=2021.

If no BTrace agent is attached to the target JVM, the deploy_oneliner and deploy_script tools auto-attach one using the JVM's attach API — no extra step required.

Client setup

See MCPServer.md for per-client configuration instructions covering Claude Desktop, Claude Code, Cursor, OpenAI Codex CLI, VS Code, Windsurf, Continue.dev, and Zed. Every client follows the same pattern — point it at btrace-mcp-server.jar as the server command — but the configuration file location and schema differ slightly between hosts.

Available tools

Tool Description
list_jvms List all local JVMs with their PIDs, main class, and BTrace attachment status
deploy_oneliner Deploy a single-expression probe using oneliner syntax (auto-attaches agent)
deploy_script Deploy a full BTrace script file to a target JVM
list_probes List active probes on a target JVM
send_event Send a named event to trigger @OnEvent handlers
detach_probe Detach a single probe without stopping BTrace
exit_probe Stop all BTrace probes and detach the agent from the JVM

Available prompts

The server ships three pre-built diagnostic workflows that the AI can invoke as structured prompts:

Prompt What it does
diagnose_slow_endpoint Attaches timing probes to HTTP handler methods and reports latency distribution
find_exception_source Traces exception construction and propagation across the call stack
profile_method Measures throughput and latency of a named method over a configurable window

Example conversation

The following exchange shows how an AI assistant uses BTrace MCP tools to diagnose a slow HTTP endpoint. Lines marked [tool] represent MCP tool calls made by the AI during the conversation.


User: My /recommendations endpoint is taking over two seconds. Can you help me find out why?

AI: I will start by finding which JVMs are running on this machine.

[tool: list_jvms]

PID    MAIN CLASS                          BTRACE
12345  com.example.app.ApplicationMain     not attached

I found your application at PID 12345. Let me deploy a timing probe to the HTTP handler layer.

[tool: deploy_oneliner]

pid: 12345
probe: /com\.example\..*/::/.*recommendations.*/i @return { time }
Probe deployed. Waiting for output…
[12345] com.example.recommendation.RecommendationController.handle: 2341 ms
[12345] com.example.recommendation.RecommendationController.handle: 2289 ms

The controller itself is taking over two seconds. Let me look inside it — I will check whether the delay is in the database call or the LLM call.

[tool: deploy_oneliner]

pid: 12345
probe: /com\.example\.recommendation\..*/::/.+/ @return { time }
[12345] RecommendationRepository.findCandidates:  1987 ms
[12345] LlmRankingService.rank:                    312 ms

The database retrieval accounts for most of the latency. The LLM ranking step is fast. Let me clean up the probes.

[tool: exit_probe]

pid: 12345
BTrace agent detached from PID 12345.

AI: The bottleneck is RecommendationRepository.findCandidates, which takes roughly 2 seconds out of the total 2.3 seconds. The LLM ranking step adds only about 300 ms. I recommend investigating the query or index used by findCandidates — adding an index on the candidate-selection columns would likely resolve most of the latency.


Security note

The BTrace MCP server only attaches to JVMs on the local machine; it cannot connect to remote processes. Every probe the AI deploys goes through BTrace's standard verifier, which enforces the same restrictions as any other BTrace script: no loops, no object allocation, no exceptions, no field writes. The AI can observe your application in detail but cannot alter its behavior or cause it to crash.