2 .\" Copyright (c) 2004, Sun Microsystems, Inc. All Rights Reserved.
3 .\" The contents of this file are subject to the terms of the Common Development and Distribution License (the "License"). You may not use this file except in compliance with the License.
4 .\" You can obtain a copy of the license at usr/src/OPENSOLARIS.LICENSE or http://www.opensolaris.org/os/licensing. See the License for the specific language governing permissions and limitations under the License.
5 .\" When distributing Covered Code, include this CDDL HEADER in each file and include the License file at usr/src/OPENSOLARIS.LICENSE. If applicable, add the following below this CDDL HEADER, with the fields enclosed by brackets "[]" replaced with your own identifying information: Portions Copyright [yyyy] [name of copyright owner]
6 .TH CPC_BUF_CREATE 3CPC "Jan 30, 2004"
8 cpc_buf_create, cpc_buf_destroy, cpc_set_sample, cpc_buf_get, cpc_buf_set,
9 cpc_buf_hrtime, cpc_buf_tick, cpc_buf_sub, cpc_buf_add, cpc_buf_copy,
10 cpc_buf_zero \- sample and manipulate CPC data
14 cc [ \fIflag\fR\&.\|.\|. ] \fIfile\fR\&.\|.\|. \fB-lcpc\fR [ \fIlibrary\fR\&.\|.\|. ]
17 \fBcpc_buf_t *\fR\fBcpc_buf_create\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_set_t *\fR\fIset\fR);
22 \fBint\fR \fBcpc_buf_destroy\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR);
27 \fBint\fR \fBcpc_set_sample\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_set_t *\fR\fIset\fR, \fBcpc_buf_t *\fR\fIbuf\fR);
32 \fBint\fR \fBcpc_buf_get\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR, \fBint\fR \fIindex\fR, \fBuint64_t *\fR\fIval\fR);
37 \fBint\fR \fBcpc_buf_set\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR, \fBint\fR \fIindex\fR, \fBuint64_t\fR \fIval\fR);
42 \fBhrtime_t\fR \fBcpc_buf_hrtime\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR);
47 \fBuint64_t\fR \fBcpc_buf_tick\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR);
52 \fBvoid\fR \fBcpc_buf_sub\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIds\fR, \fBcpc_buf_t *\fR\fIa\fR, \fBcpc_buf_t *\fR\fIb\fR);
57 \fBvoid\fR \fBcpc_buf_add\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIds\fR, \fBcpc_buf_t *\fR\fIa\fR, \fBcpc_buf_t *\fR\fIb\fR);
62 \fBvoid\fR \fBcpc_buf_copy\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIds\fR, \fBcpc_buf_t *\fR\fIsrc\fR);
67 \fBvoid\fR \fBcpc_buf_zero\fR(\fBcpc_t *\fR\fIcpc\fR, \fBcpc_buf_t *\fR\fIbuf\fR);
73 Counter data is sampled into CPC buffers, which are represented by the opaque
74 data type \fBcpc_buf_t\fR. A CPC buffer is created with \fBcpc_buf_create()\fR
75 to hold the data for a specific CPC set. Once a CPC buffer has been created, it
76 can only be used to store and manipulate the data of the CPC set for which it
80 Once a set has been successfully bound, the counter values are sampled using
81 \fBcpc_set_sample()\fR. The \fBcpc_set_sample()\fR function takes a snapshot of
82 the hardware performance counters counting on behalf of the requests in
83 \fIset\fR and stores the 64-bit virtualized software representations of the
84 counters in the supplied CPC buffer. If a set was bound with
85 \fBcpc_bind_curlwp\fR(3CPC) or \fBcpc_bind_curlwp\fR(3CPC), the set can only be
86 sampled by the LWP that bound it.
89 The kernel maintains 64-bit virtual software counters to hold the counts
90 accumulated for each request in the set, thereby allowing applications to count
91 past the limits of the underlying physical counter, which can be significantly
92 smaller than 64 bits. The kernel attempts to maintain the full 64-bit counter
93 values even in the face of physical counter overflow on architectures and
94 processors that can automatically detect overflow. If the processor is not
95 capable of overflow detection, the caller must ensure that the counters are
96 sampled often enough to avoid the physical counters wrapping. The events most
97 prone to wrap are those that count processor clock cycles. If such an event is
98 of interest, sampling should occur frequently so that the counter does not wrap
102 The \fBcpc_buf_get()\fR function retrieves the last sampled value of a
103 particular request in \fIbuf\fR. The \fIindex\fR argument specifies which
104 request value in the set to retrieve. The index for each request is returned
105 during set configuration by \fBcpc_set_add_request\fR(3CPC). The 64-bit
106 virtualized software counter value is stored in the location pointed to by the
110 The \fBcpc_buf_set()\fR function stores a 64-bit value to a specific request in
111 the supplied buffer. This operation can be useful for performing calculations
112 with CPC buffers, but it does not affect the value of the hardware counter (and
113 thus will not affect the next sample).
116 The \fBcpc_buf_hrtime()\fR function returns a high-resolution timestamp
117 indicating exactly when the set was last sampled by the kernel.
120 The\fB cpc_buf_tick()\fR function returns a 64-bit virtualized cycle counter
121 indicating how long the set has been programmed into the counter since it was
122 bound. The units of the values returned by \fBcpc_buf_tick()\fR are CPU clock
126 The \fBcpc_buf_sub()\fR function calculates the difference between each request
127 in sets \fIa\fR and \fIb\fR, storing the result in the corresponding request
128 within set \fIds\fR. More specifically, for each request index \fIn\fR, this
129 function performs \fIds\fR[\fIn\fR] = \fIa\fR[\fIn\fR] - \fIb\fR[n]. Similarly,
130 \fBcpc_buf_add()\fR adds each request in sets \fIa\fR and \fIb\fR and stores
131 the result in the corresponding request within set \fIds\fR.
134 The \fBcpc_buf_copy()\fR function copies each value from buffer \fIsrc\fR into
135 buffer \fIds\fR. Both buffers must have been created from the same
139 The \fBcpc_buf_zero()\fR function sets each request's value in the buffer to
143 The \fBcpc_buf_destroy()\fR function frees all resources associated with the
148 Upon successful completion, \fBcpc_buf_create()\fR returns a pointer to a CPC
149 buffer which can be used to hold data for the set argument. Otherwise, this
150 function returns \fINULL\fR and sets \fBerrno\fR to indicate the error.
153 Upon successful completion, \fBcpc_set_sample()\fR, \fBcpc_buf_get()\fR, and
154 \fBcpc_buf_set()\fR return 0. Otherwise, they return -1 and set \fBerrno\fR to
159 These functions will fail if:
166 For \fBcpc_set_sample()\fR, the set is not bound, the set and/or CPC buffer
167 were not created with the given \fIcpc\fR handle, or the CPC buffer was not
168 created with the supplied set.
177 When using \fBcpc_set_sample()\fR to sample a CPU-bound set, the LWP has been
178 unbound from the processor it is measuring.
187 The library could not allocate enough memory for its internal data structures.
193 See \fBattributes\fR(5) for descriptions of the following attributes:
201 ATTRIBUTE TYPE ATTRIBUTE VALUE
203 Interface Stability Evolving
211 \fBcpc_bind_curlwp\fR(3CPC), \fBcpc_set_add_request\fR(3CPC),
212 \fBlibcpc\fR(3LIB), \fBattributes\fR(5)
216 Often the overhead of performing a system call can be too disruptive to the
217 events being measured. Once a \fBcpc_bind_curlwp\fR(3CPC) call has been issued,
218 it is possible to access directly the performance hardware registers from
219 within the application. If the performance counter context is active, the
220 counters will count on behalf of the current LWP.
223 Not all processors support this type of access. On processors where direct
224 access is not possible, \fBcpc_set_sample()\fR must be used to read the
235 rd %pic, %rN ! All UltraSPARC
236 wr %rN, %pic ! (All UltraSPARC, but see text)
251 rdpmc ! Pentium II, III, and 4 only
259 If the counter context is not active or has been invalidated, the \fB%pic\fR
260 register (SPARC), and the \fBrdpmc\fR instruction (Pentium) becomes
264 Pentium II and III processors support the non-privileged rdpmc instruction that
265 requires that the counter of interest be specified in \fB%ecx\fR and return a
266 40-bit value in the \fB%edx\fR:\fB%eax\fR register pair. There is no
267 non-privileged access mechanism for Pentium I processors.