Snap pinch zoom gestures near the screen edge.
[chromium-blink-merge.git] / components / metrics / metrics_service.cc
blob8d32b23ae7e04bb41ec1fed9adf3c40dccaafa7e
1 // Copyright 2014 The Chromium Authors. All rights reserved.
2 // Use of this source code is governed by a BSD-style license that can be
3 // found in the LICENSE file.
5 //------------------------------------------------------------------------------
6 // Description of the life cycle of a instance of MetricsService.
7 //
8 // OVERVIEW
9 //
10 // A MetricsService instance is typically created at application startup. It is
11 // the central controller for the acquisition of log data, and the automatic
12 // transmission of that log data to an external server. Its major job is to
13 // manage logs, grouping them for transmission, and transmitting them. As part
14 // of its grouping, MS finalizes logs by including some just-in-time gathered
15 // memory statistics, snapshotting the current stats of numerous histograms,
16 // closing the logs, translating to protocol buffer format, and compressing the
17 // results for transmission. Transmission includes submitting a compressed log
18 // as data in a URL-post, and retransmitting (or retaining at process
19 // termination) if the attempted transmission failed. Retention across process
20 // terminations is done using the the PrefServices facilities. The retained logs
21 // (the ones that never got transmitted) are compressed and base64-encoded
22 // before being persisted.
24 // Logs fall into one of two categories: "initial logs," and "ongoing logs."
25 // There is at most one initial log sent for each complete run of Chrome (from
26 // startup, to browser shutdown). An initial log is generally transmitted some
27 // short time (1 minute?) after startup, and includes stats such as recent crash
28 // info, the number and types of plugins, etc. The external server's response
29 // to the initial log conceptually tells this MS if it should continue
30 // transmitting logs (during this session). The server response can actually be
31 // much more detailed, and always includes (at a minimum) how often additional
32 // ongoing logs should be sent.
34 // After the above initial log, a series of ongoing logs will be transmitted.
35 // The first ongoing log actually begins to accumulate information stating when
36 // the MS was first constructed. Note that even though the initial log is
37 // commonly sent a full minute after startup, the initial log does not include
38 // much in the way of user stats. The most common interlog period (delay)
39 // is 30 minutes. That time period starts when the first user action causes a
40 // logging event. This means that if there is no user action, there may be long
41 // periods without any (ongoing) log transmissions. Ongoing logs typically
42 // contain very detailed records of user activities (ex: opened tab, closed
43 // tab, fetched URL, maximized window, etc.) In addition, just before an
44 // ongoing log is closed out, a call is made to gather memory statistics. Those
45 // memory statistics are deposited into a histogram, and the log finalization
46 // code is then called. In the finalization, a call to a Histogram server
47 // acquires a list of all local histograms that have been flagged for upload
48 // to the UMA server. The finalization also acquires the most recent number
49 // of page loads, along with any counts of renderer or plugin crashes.
51 // When the browser shuts down, there will typically be a fragment of an ongoing
52 // log that has not yet been transmitted. At shutdown time, that fragment is
53 // closed (including snapshotting histograms), and persisted, for potential
54 // transmission during a future run of the product.
56 // There are two slightly abnormal shutdown conditions. There is a
57 // "disconnected scenario," and a "really fast startup and shutdown" scenario.
58 // In the "never connected" situation, the user has (during the running of the
59 // process) never established an internet connection. As a result, attempts to
60 // transmit the initial log have failed, and a lot(?) of data has accumulated in
61 // the ongoing log (which didn't yet get closed, because there was never even a
62 // contemplation of sending it). There is also a kindred "lost connection"
63 // situation, where a loss of connection prevented an ongoing log from being
64 // transmitted, and a (still open) log was stuck accumulating a lot(?) of data,
65 // while the earlier log retried its transmission. In both of these
66 // disconnected situations, two logs need to be, and are, persistently stored
67 // for future transmission.
69 // The other unusual shutdown condition, termed "really fast startup and
70 // shutdown," involves the deliberate user termination of the process before
71 // the initial log is even formed or transmitted. In that situation, no logging
72 // is done, but the historical crash statistics remain (unlogged) for inclusion
73 // in a future run's initial log. (i.e., we don't lose crash stats).
75 // With the above overview, we can now describe the state machine's various
76 // states, based on the State enum specified in the state_ member. Those states
77 // are:
79 // INITIALIZED, // Constructor was called.
80 // INIT_TASK_SCHEDULED, // Waiting for deferred init tasks to finish.
81 // INIT_TASK_DONE, // Waiting for timer to send initial log.
82 // SENDING_LOGS, // Sending logs and creating new ones when we run out.
84 // In more detail, we have:
86 // INITIALIZED, // Constructor was called.
87 // The MS has been constructed, but has taken no actions to compose the
88 // initial log.
90 // INIT_TASK_SCHEDULED, // Waiting for deferred init tasks to finish.
91 // Typically about 30 seconds after startup, a task is sent to a second thread
92 // (the file thread) to perform deferred (lower priority and slower)
93 // initialization steps such as getting the list of plugins. That task will
94 // (when complete) make an async callback (via a Task) to indicate the
95 // completion.
97 // INIT_TASK_DONE, // Waiting for timer to send initial log.
98 // The callback has arrived, and it is now possible for an initial log to be
99 // created. This callback typically arrives back less than one second after
100 // the deferred init task is dispatched.
102 // SENDING_LOGS, // Sending logs an creating new ones when we run out.
103 // Logs from previous sessions have been loaded, and initial logs have been
104 // created (an optional stability log and the first metrics log). We will
105 // send all of these logs, and when run out, we will start cutting new logs
106 // to send. We will also cut a new log if we expect a shutdown.
108 // The progression through the above states is simple, and sequential.
109 // States proceed from INITIAL to SENDING_LOGS, and remain in the latter until
110 // shutdown.
112 // Also note that whenever we successfully send a log, we mirror the list
113 // of logs into the PrefService. This ensures that IF we crash, we won't start
114 // up and retransmit our old logs again.
116 // Due to race conditions, it is always possible that a log file could be sent
117 // twice. For example, if a log file is sent, but not yet acknowledged by
118 // the external server, and the user shuts down, then a copy of the log may be
119 // saved for re-transmission. These duplicates could be filtered out server
120 // side, but are not expected to be a significant problem.
123 //------------------------------------------------------------------------------
125 #include "components/metrics/metrics_service.h"
127 #include <algorithm>
129 #include "base/bind.h"
130 #include "base/callback.h"
131 #include "base/location.h"
132 #include "base/metrics/histogram_base.h"
133 #include "base/metrics/histogram_macros.h"
134 #include "base/metrics/histogram_samples.h"
135 #include "base/metrics/sparse_histogram.h"
136 #include "base/metrics/statistics_recorder.h"
137 #include "base/prefs/pref_registry_simple.h"
138 #include "base/prefs/pref_service.h"
139 #include "base/rand_util.h"
140 #include "base/single_thread_task_runner.h"
141 #include "base/strings/string_number_conversions.h"
142 #include "base/strings/utf_string_conversions.h"
143 #include "base/thread_task_runner_handle.h"
144 #include "base/threading/platform_thread.h"
145 #include "base/threading/thread.h"
146 #include "base/threading/thread_restrictions.h"
147 #include "base/time/time.h"
148 #include "base/tracked_objects.h"
149 #include "base/values.h"
150 #include "components/metrics/metrics_log.h"
151 #include "components/metrics/metrics_log_manager.h"
152 #include "components/metrics/metrics_log_uploader.h"
153 #include "components/metrics/metrics_pref_names.h"
154 #include "components/metrics/metrics_reporting_scheduler.h"
155 #include "components/metrics/metrics_service_client.h"
156 #include "components/metrics/metrics_state_manager.h"
157 #include "components/variations/entropy_provider.h"
158 #include "components/variations/variations_associated_data.h"
160 namespace metrics {
162 namespace {
164 // Check to see that we're being called on only one thread.
165 bool IsSingleThreaded() {
166 static base::PlatformThreadId thread_id = 0;
167 if (!thread_id)
168 thread_id = base::PlatformThread::CurrentId();
169 return base::PlatformThread::CurrentId() == thread_id;
172 // The delay, in seconds, after starting recording before doing expensive
173 // initialization work.
174 #if defined(OS_ANDROID) || defined(OS_IOS)
175 // On mobile devices, a significant portion of sessions last less than a minute.
176 // Use a shorter timer on these platforms to avoid losing data.
177 // TODO(dfalcantara): To avoid delaying startup, tighten up initialization so
178 // that it occurs after the user gets their initial page.
179 const int kInitializationDelaySeconds = 5;
180 #else
181 const int kInitializationDelaySeconds = 30;
182 #endif
184 // The maximum number of events in a log uploaded to the UMA server.
185 const int kEventLimit = 2400;
187 // If an upload fails, and the transmission was over this byte count, then we
188 // will discard the log, and not try to retransmit it. We also don't persist
189 // the log to the prefs for transmission during the next chrome session if this
190 // limit is exceeded.
191 const size_t kUploadLogAvoidRetransmitSize = 100 * 1024;
193 // Interval, in minutes, between state saves.
194 const int kSaveStateIntervalMinutes = 5;
196 enum ResponseStatus {
197 UNKNOWN_FAILURE,
198 SUCCESS,
199 BAD_REQUEST, // Invalid syntax or log too large.
200 NO_RESPONSE,
201 NUM_RESPONSE_STATUSES
204 ResponseStatus ResponseCodeToStatus(int response_code) {
205 switch (response_code) {
206 case -1:
207 return NO_RESPONSE;
208 case 200:
209 return SUCCESS;
210 case 400:
211 return BAD_REQUEST;
212 default:
213 return UNKNOWN_FAILURE;
217 #if defined(OS_ANDROID) || defined(OS_IOS)
218 void MarkAppCleanShutdownAndCommit(CleanExitBeacon* clean_exit_beacon,
219 PrefService* local_state) {
220 clean_exit_beacon->WriteBeaconValue(true);
221 local_state->SetInteger(prefs::kStabilityExecutionPhase,
222 MetricsService::SHUTDOWN_COMPLETE);
223 // Start writing right away (write happens on a different thread).
224 local_state->CommitPendingWrite();
226 #endif // defined(OS_ANDROID) || defined(OS_IOS)
228 // Determines if current log should be sent based on sampling rate. Returns true
229 // if the sampling rate is not set.
230 bool ShouldUploadLog() {
231 std::string probability_str = variations::GetVariationParamValue(
232 "UMA_EnableCellularLogUpload", "Sample_Probability");
233 if (probability_str.empty())
234 return true;
236 int probability;
237 // In case specified sampling rate is invalid.
238 if (!base::StringToInt(probability_str, &probability))
239 return true;
240 return base::RandInt(1, 100) <= probability;
243 } // namespace
246 SyntheticTrialGroup::SyntheticTrialGroup(uint32 trial, uint32 group) {
247 id.name = trial;
248 id.group = group;
251 SyntheticTrialGroup::~SyntheticTrialGroup() {
254 // static
255 MetricsService::ShutdownCleanliness MetricsService::clean_shutdown_status_ =
256 MetricsService::CLEANLY_SHUTDOWN;
258 MetricsService::ExecutionPhase MetricsService::execution_phase_ =
259 MetricsService::UNINITIALIZED_PHASE;
261 // static
262 void MetricsService::RegisterPrefs(PrefRegistrySimple* registry) {
263 DCHECK(IsSingleThreaded());
264 MetricsStateManager::RegisterPrefs(registry);
265 MetricsLog::RegisterPrefs(registry);
267 registry->RegisterInt64Pref(prefs::kInstallDate, 0);
269 registry->RegisterInt64Pref(prefs::kStabilityLaunchTimeSec, 0);
270 registry->RegisterInt64Pref(prefs::kStabilityLastTimestampSec, 0);
271 registry->RegisterStringPref(prefs::kStabilityStatsVersion, std::string());
272 registry->RegisterInt64Pref(prefs::kStabilityStatsBuildTime, 0);
273 registry->RegisterBooleanPref(prefs::kStabilityExitedCleanly, true);
274 registry->RegisterIntegerPref(prefs::kStabilityExecutionPhase,
275 UNINITIALIZED_PHASE);
276 registry->RegisterBooleanPref(prefs::kStabilitySessionEndCompleted, true);
277 registry->RegisterIntegerPref(prefs::kMetricsSessionID, -1);
279 registry->RegisterListPref(prefs::kMetricsInitialLogs);
280 registry->RegisterListPref(prefs::kMetricsOngoingLogs);
282 registry->RegisterInt64Pref(prefs::kUninstallLaunchCount, 0);
283 registry->RegisterInt64Pref(prefs::kUninstallMetricsUptimeSec, 0);
286 MetricsService::MetricsService(MetricsStateManager* state_manager,
287 MetricsServiceClient* client,
288 PrefService* local_state)
289 : log_manager_(local_state, kUploadLogAvoidRetransmitSize),
290 histogram_snapshot_manager_(this),
291 state_manager_(state_manager),
292 client_(client),
293 local_state_(local_state),
294 clean_exit_beacon_(client->GetRegistryBackupKey(), local_state),
295 recording_active_(false),
296 reporting_active_(false),
297 test_mode_active_(false),
298 state_(INITIALIZED),
299 log_upload_in_progress_(false),
300 idle_since_last_transmission_(false),
301 session_id_(-1),
302 self_ptr_factory_(this),
303 state_saver_factory_(this) {
304 DCHECK(IsSingleThreaded());
305 DCHECK(state_manager_);
306 DCHECK(client_);
307 DCHECK(local_state_);
309 // Set the install date if this is our first run.
310 int64 install_date = local_state_->GetInt64(prefs::kInstallDate);
311 if (install_date == 0)
312 local_state_->SetInt64(prefs::kInstallDate, base::Time::Now().ToTimeT());
315 MetricsService::~MetricsService() {
316 DisableRecording();
319 void MetricsService::InitializeMetricsRecordingState() {
320 InitializeMetricsState();
322 base::Closure upload_callback =
323 base::Bind(&MetricsService::StartScheduledUpload,
324 self_ptr_factory_.GetWeakPtr());
325 scheduler_.reset(
326 new MetricsReportingScheduler(
327 upload_callback,
328 // MetricsServiceClient outlives MetricsService, and
329 // MetricsReportingScheduler is tied to the lifetime of |this|.
330 base::Bind(&MetricsServiceClient::GetStandardUploadInterval,
331 base::Unretained(client_))));
334 void MetricsService::Start() {
335 HandleIdleSinceLastTransmission(false);
336 EnableRecording();
337 EnableReporting();
340 bool MetricsService::StartIfMetricsReportingEnabled() {
341 const bool enabled = state_manager_->IsMetricsReportingEnabled();
342 if (enabled)
343 Start();
344 return enabled;
347 void MetricsService::StartRecordingForTests() {
348 test_mode_active_ = true;
349 EnableRecording();
350 DisableReporting();
353 void MetricsService::Stop() {
354 HandleIdleSinceLastTransmission(false);
355 DisableReporting();
356 DisableRecording();
359 void MetricsService::EnableReporting() {
360 if (reporting_active_)
361 return;
362 reporting_active_ = true;
363 StartSchedulerIfNecessary();
366 void MetricsService::DisableReporting() {
367 reporting_active_ = false;
370 std::string MetricsService::GetClientId() {
371 return state_manager_->client_id();
374 int64 MetricsService::GetInstallDate() {
375 return local_state_->GetInt64(prefs::kInstallDate);
378 int64 MetricsService::GetMetricsReportingEnabledDate() {
379 return local_state_->GetInt64(prefs::kMetricsReportingEnabledTimestamp);
382 scoped_ptr<const base::FieldTrial::EntropyProvider>
383 MetricsService::CreateEntropyProvider() {
384 // TODO(asvitkine): Refactor the code so that MetricsService does not expose
385 // this method.
386 return state_manager_->CreateEntropyProvider();
389 void MetricsService::EnableRecording() {
390 DCHECK(IsSingleThreaded());
392 if (recording_active_)
393 return;
394 recording_active_ = true;
396 state_manager_->ForceClientIdCreation();
397 client_->SetMetricsClientId(state_manager_->client_id());
398 if (!log_manager_.current_log())
399 OpenNewLog();
401 for (size_t i = 0; i < metrics_providers_.size(); ++i)
402 metrics_providers_[i]->OnRecordingEnabled();
404 base::RemoveActionCallback(action_callback_);
405 action_callback_ = base::Bind(&MetricsService::OnUserAction,
406 base::Unretained(this));
407 base::AddActionCallback(action_callback_);
410 void MetricsService::DisableRecording() {
411 DCHECK(IsSingleThreaded());
413 if (!recording_active_)
414 return;
415 recording_active_ = false;
417 client_->OnRecordingDisabled();
419 base::RemoveActionCallback(action_callback_);
421 for (size_t i = 0; i < metrics_providers_.size(); ++i)
422 metrics_providers_[i]->OnRecordingDisabled();
424 PushPendingLogsToPersistentStorage();
427 bool MetricsService::recording_active() const {
428 DCHECK(IsSingleThreaded());
429 return recording_active_;
432 bool MetricsService::reporting_active() const {
433 DCHECK(IsSingleThreaded());
434 return reporting_active_;
437 void MetricsService::RecordDelta(const base::HistogramBase& histogram,
438 const base::HistogramSamples& snapshot) {
439 log_manager_.current_log()->RecordHistogramDelta(histogram.histogram_name(),
440 snapshot);
443 void MetricsService::InconsistencyDetected(
444 base::HistogramBase::Inconsistency problem) {
445 UMA_HISTOGRAM_ENUMERATION("Histogram.InconsistenciesBrowser",
446 problem, base::HistogramBase::NEVER_EXCEEDED_VALUE);
449 void MetricsService::UniqueInconsistencyDetected(
450 base::HistogramBase::Inconsistency problem) {
451 UMA_HISTOGRAM_ENUMERATION("Histogram.InconsistenciesBrowserUnique",
452 problem, base::HistogramBase::NEVER_EXCEEDED_VALUE);
455 void MetricsService::InconsistencyDetectedInLoggedCount(int amount) {
456 UMA_HISTOGRAM_COUNTS("Histogram.InconsistentSnapshotBrowser",
457 std::abs(amount));
460 void MetricsService::HandleIdleSinceLastTransmission(bool in_idle) {
461 // If there wasn't a lot of action, maybe the computer was asleep, in which
462 // case, the log transmissions should have stopped. Here we start them up
463 // again.
464 if (!in_idle && idle_since_last_transmission_)
465 StartSchedulerIfNecessary();
466 idle_since_last_transmission_ = in_idle;
469 void MetricsService::OnApplicationNotIdle() {
470 if (recording_active_)
471 HandleIdleSinceLastTransmission(false);
474 void MetricsService::RecordStartOfSessionEnd() {
475 LogCleanShutdown();
476 RecordBooleanPrefValue(prefs::kStabilitySessionEndCompleted, false);
479 void MetricsService::RecordCompletedSessionEnd() {
480 LogCleanShutdown();
481 RecordBooleanPrefValue(prefs::kStabilitySessionEndCompleted, true);
484 #if defined(OS_ANDROID) || defined(OS_IOS)
485 void MetricsService::OnAppEnterBackground() {
486 scheduler_->Stop();
488 MarkAppCleanShutdownAndCommit(&clean_exit_beacon_, local_state_);
490 // At this point, there's no way of knowing when the process will be
491 // killed, so this has to be treated similar to a shutdown, closing and
492 // persisting all logs. Unlinke a shutdown, the state is primed to be ready
493 // to continue logging and uploading if the process does return.
494 if (recording_active() && state_ >= SENDING_LOGS) {
495 PushPendingLogsToPersistentStorage();
496 // Persisting logs closes the current log, so start recording a new log
497 // immediately to capture any background work that might be done before the
498 // process is killed.
499 OpenNewLog();
503 void MetricsService::OnAppEnterForeground() {
504 clean_exit_beacon_.WriteBeaconValue(false);
505 StartSchedulerIfNecessary();
507 #else
508 void MetricsService::LogNeedForCleanShutdown() {
509 clean_exit_beacon_.WriteBeaconValue(false);
510 // Redundant setting to be sure we call for a clean shutdown.
511 clean_shutdown_status_ = NEED_TO_SHUTDOWN;
513 #endif // defined(OS_ANDROID) || defined(OS_IOS)
515 // static
516 void MetricsService::SetExecutionPhase(ExecutionPhase execution_phase,
517 PrefService* local_state) {
518 execution_phase_ = execution_phase;
519 local_state->SetInteger(prefs::kStabilityExecutionPhase, execution_phase_);
522 void MetricsService::RecordBreakpadRegistration(bool success) {
523 if (!success)
524 IncrementPrefValue(prefs::kStabilityBreakpadRegistrationFail);
525 else
526 IncrementPrefValue(prefs::kStabilityBreakpadRegistrationSuccess);
529 void MetricsService::RecordBreakpadHasDebugger(bool has_debugger) {
530 if (!has_debugger)
531 IncrementPrefValue(prefs::kStabilityDebuggerNotPresent);
532 else
533 IncrementPrefValue(prefs::kStabilityDebuggerPresent);
536 void MetricsService::ClearSavedStabilityMetrics() {
537 for (size_t i = 0; i < metrics_providers_.size(); ++i)
538 metrics_providers_[i]->ClearSavedStabilityMetrics();
540 // Reset the prefs that are managed by MetricsService/MetricsLog directly.
541 local_state_->SetInteger(prefs::kStabilityCrashCount, 0);
542 local_state_->SetInteger(prefs::kStabilityExecutionPhase,
543 UNINITIALIZED_PHASE);
544 local_state_->SetInteger(prefs::kStabilityIncompleteSessionEndCount, 0);
545 local_state_->SetInteger(prefs::kStabilityLaunchCount, 0);
546 local_state_->SetBoolean(prefs::kStabilitySessionEndCompleted, true);
549 void MetricsService::PushExternalLog(const std::string& log) {
550 log_manager_.StoreLog(log, MetricsLog::ONGOING_LOG);
553 //------------------------------------------------------------------------------
554 // private methods
555 //------------------------------------------------------------------------------
558 //------------------------------------------------------------------------------
559 // Initialization methods
561 void MetricsService::InitializeMetricsState() {
562 const int64 buildtime = MetricsLog::GetBuildTime();
563 const std::string version = client_->GetVersionString();
564 bool version_changed = false;
565 if (local_state_->GetInt64(prefs::kStabilityStatsBuildTime) != buildtime ||
566 local_state_->GetString(prefs::kStabilityStatsVersion) != version) {
567 local_state_->SetString(prefs::kStabilityStatsVersion, version);
568 local_state_->SetInt64(prefs::kStabilityStatsBuildTime, buildtime);
569 version_changed = true;
572 log_manager_.LoadPersistedUnsentLogs();
574 session_id_ = local_state_->GetInteger(prefs::kMetricsSessionID);
576 if (!clean_exit_beacon_.exited_cleanly()) {
577 IncrementPrefValue(prefs::kStabilityCrashCount);
578 // Reset flag, and wait until we call LogNeedForCleanShutdown() before
579 // monitoring.
580 clean_exit_beacon_.WriteBeaconValue(true);
583 bool has_initial_stability_log = false;
584 if (!clean_exit_beacon_.exited_cleanly() ||
585 ProvidersHaveInitialStabilityMetrics()) {
586 // TODO(rtenneti): On windows, consider saving/getting execution_phase from
587 // the registry.
588 int execution_phase =
589 local_state_->GetInteger(prefs::kStabilityExecutionPhase);
590 UMA_HISTOGRAM_SPARSE_SLOWLY("Chrome.Browser.CrashedExecutionPhase",
591 execution_phase);
593 // If the previous session didn't exit cleanly, or if any provider
594 // explicitly requests it, prepare an initial stability log -
595 // provided UMA is enabled.
596 if (state_manager_->IsMetricsReportingEnabled())
597 has_initial_stability_log = PrepareInitialStabilityLog();
600 // If no initial stability log was generated and there was a version upgrade,
601 // clear the stability stats from the previous version (so that they don't get
602 // attributed to the current version). This could otherwise happen due to a
603 // number of different edge cases, such as if the last version crashed before
604 // it could save off a system profile or if UMA reporting is disabled (which
605 // normally results in stats being accumulated).
606 if (!has_initial_stability_log && version_changed)
607 ClearSavedStabilityMetrics();
609 // Update session ID.
610 ++session_id_;
611 local_state_->SetInteger(prefs::kMetricsSessionID, session_id_);
613 // Stability bookkeeping
614 IncrementPrefValue(prefs::kStabilityLaunchCount);
616 DCHECK_EQ(UNINITIALIZED_PHASE, execution_phase_);
617 SetExecutionPhase(START_METRICS_RECORDING, local_state_);
619 if (!local_state_->GetBoolean(prefs::kStabilitySessionEndCompleted)) {
620 IncrementPrefValue(prefs::kStabilityIncompleteSessionEndCount);
621 // This is marked false when we get a WM_ENDSESSION.
622 local_state_->SetBoolean(prefs::kStabilitySessionEndCompleted, true);
625 // Call GetUptimes() for the first time, thus allowing all later calls
626 // to record incremental uptimes accurately.
627 base::TimeDelta ignored_uptime_parameter;
628 base::TimeDelta startup_uptime;
629 GetUptimes(local_state_, &startup_uptime, &ignored_uptime_parameter);
630 DCHECK_EQ(0, startup_uptime.InMicroseconds());
631 // For backwards compatibility, leave this intact in case Omaha is checking
632 // them. prefs::kStabilityLastTimestampSec may also be useless now.
633 // TODO(jar): Delete these if they have no uses.
634 local_state_->SetInt64(prefs::kStabilityLaunchTimeSec,
635 base::Time::Now().ToTimeT());
637 // Bookkeeping for the uninstall metrics.
638 IncrementLongPrefsValue(prefs::kUninstallLaunchCount);
640 // Kick off the process of saving the state (so the uptime numbers keep
641 // getting updated) every n minutes.
642 ScheduleNextStateSave();
645 void MetricsService::OnUserAction(const std::string& action) {
646 if (!ShouldLogEvents())
647 return;
649 log_manager_.current_log()->RecordUserAction(action);
650 HandleIdleSinceLastTransmission(false);
653 void MetricsService::FinishedGatheringInitialMetrics() {
654 DCHECK_EQ(INIT_TASK_SCHEDULED, state_);
655 state_ = INIT_TASK_DONE;
657 // Create the initial log.
658 if (!initial_metrics_log_.get()) {
659 initial_metrics_log_ = CreateLog(MetricsLog::ONGOING_LOG);
660 NotifyOnDidCreateMetricsLog();
663 scheduler_->InitTaskComplete();
666 void MetricsService::GetUptimes(PrefService* pref,
667 base::TimeDelta* incremental_uptime,
668 base::TimeDelta* uptime) {
669 base::TimeTicks now = base::TimeTicks::Now();
670 // If this is the first call, init |first_updated_time_| and
671 // |last_updated_time_|.
672 if (last_updated_time_.is_null()) {
673 first_updated_time_ = now;
674 last_updated_time_ = now;
676 *incremental_uptime = now - last_updated_time_;
677 *uptime = now - first_updated_time_;
678 last_updated_time_ = now;
680 const int64 incremental_time_secs = incremental_uptime->InSeconds();
681 if (incremental_time_secs > 0) {
682 int64 metrics_uptime = pref->GetInt64(prefs::kUninstallMetricsUptimeSec);
683 metrics_uptime += incremental_time_secs;
684 pref->SetInt64(prefs::kUninstallMetricsUptimeSec, metrics_uptime);
688 void MetricsService::NotifyOnDidCreateMetricsLog() {
689 DCHECK(IsSingleThreaded());
690 for (size_t i = 0; i < metrics_providers_.size(); ++i)
691 metrics_providers_[i]->OnDidCreateMetricsLog();
694 //------------------------------------------------------------------------------
695 // State save methods
697 void MetricsService::ScheduleNextStateSave() {
698 state_saver_factory_.InvalidateWeakPtrs();
700 base::ThreadTaskRunnerHandle::Get()->PostDelayedTask(
701 FROM_HERE, base::Bind(&MetricsService::SaveLocalState,
702 state_saver_factory_.GetWeakPtr()),
703 base::TimeDelta::FromMinutes(kSaveStateIntervalMinutes));
706 void MetricsService::SaveLocalState() {
707 RecordCurrentState(local_state_);
709 // TODO(jar):110021 Does this run down the batteries????
710 ScheduleNextStateSave();
714 //------------------------------------------------------------------------------
715 // Recording control methods
717 void MetricsService::OpenNewLog() {
718 DCHECK(!log_manager_.current_log());
720 log_manager_.BeginLoggingWithLog(CreateLog(MetricsLog::ONGOING_LOG));
721 NotifyOnDidCreateMetricsLog();
722 if (state_ == INITIALIZED) {
723 // We only need to schedule that run once.
724 state_ = INIT_TASK_SCHEDULED;
726 base::ThreadTaskRunnerHandle::Get()->PostDelayedTask(
727 FROM_HERE, base::Bind(&MetricsService::StartGatheringMetrics,
728 self_ptr_factory_.GetWeakPtr()),
729 base::TimeDelta::FromSeconds(kInitializationDelaySeconds));
733 void MetricsService::StartGatheringMetrics() {
734 client_->StartGatheringMetrics(
735 base::Bind(&MetricsService::FinishedGatheringInitialMetrics,
736 self_ptr_factory_.GetWeakPtr()));
739 void MetricsService::CloseCurrentLog() {
740 if (!log_manager_.current_log())
741 return;
743 // TODO(jar): Integrate bounds on log recording more consistently, so that we
744 // can stop recording logs that are too big much sooner.
745 if (log_manager_.current_log()->num_events() > kEventLimit) {
746 UMA_HISTOGRAM_COUNTS("UMA.Discarded Log Events",
747 log_manager_.current_log()->num_events());
748 log_manager_.DiscardCurrentLog();
749 OpenNewLog(); // Start trivial log to hold our histograms.
752 // Put incremental data (histogram deltas, and realtime stats deltas) at the
753 // end of all log transmissions (initial log handles this separately).
754 // RecordIncrementalStabilityElements only exists on the derived
755 // MetricsLog class.
756 MetricsLog* current_log = log_manager_.current_log();
757 DCHECK(current_log);
758 RecordCurrentEnvironment(current_log);
759 base::TimeDelta incremental_uptime;
760 base::TimeDelta uptime;
761 GetUptimes(local_state_, &incremental_uptime, &uptime);
762 current_log->RecordStabilityMetrics(metrics_providers_.get(),
763 incremental_uptime, uptime);
765 current_log->RecordGeneralMetrics(metrics_providers_.get());
766 RecordCurrentHistograms();
768 log_manager_.FinishCurrentLog();
771 void MetricsService::PushPendingLogsToPersistentStorage() {
772 if (state_ < SENDING_LOGS)
773 return; // We didn't and still don't have time to get plugin list etc.
775 CloseCurrentLog();
776 log_manager_.PersistUnsentLogs();
779 //------------------------------------------------------------------------------
780 // Transmission of logs methods
782 void MetricsService::StartSchedulerIfNecessary() {
783 // Never schedule cutting or uploading of logs in test mode.
784 if (test_mode_active_)
785 return;
787 // Even if reporting is disabled, the scheduler is needed to trigger the
788 // creation of the initial log, which must be done in order for any logs to be
789 // persisted on shutdown or backgrounding.
790 if (recording_active() &&
791 (reporting_active() || state_ < SENDING_LOGS)) {
792 scheduler_->Start();
796 void MetricsService::StartScheduledUpload() {
797 DCHECK(state_ >= INIT_TASK_DONE);
798 // If we're getting no notifications, then the log won't have much in it, and
799 // it's possible the computer is about to go to sleep, so don't upload and
800 // stop the scheduler.
801 // If recording has been turned off, the scheduler doesn't need to run.
802 // If reporting is off, proceed if the initial log hasn't been created, since
803 // that has to happen in order for logs to be cut and stored when persisting.
804 // TODO(stuartmorgan): Call Stop() on the scheduler when reporting and/or
805 // recording are turned off instead of letting it fire and then aborting.
806 if (idle_since_last_transmission_ ||
807 !recording_active() ||
808 (!reporting_active() && state_ >= SENDING_LOGS)) {
809 scheduler_->Stop();
810 scheduler_->UploadCancelled();
811 return;
814 // If there are unsent logs, send the next one. If not, start the asynchronous
815 // process of finalizing the current log for upload.
816 if (state_ == SENDING_LOGS && log_manager_.has_unsent_logs()) {
817 SendNextLog();
818 } else {
819 // There are no logs left to send, so start creating a new one.
820 client_->CollectFinalMetrics(
821 base::Bind(&MetricsService::OnFinalLogInfoCollectionDone,
822 self_ptr_factory_.GetWeakPtr()));
826 void MetricsService::OnFinalLogInfoCollectionDone() {
827 // If somehow there is a log upload in progress, we return and hope things
828 // work out. The scheduler isn't informed since if this happens, the scheduler
829 // will get a response from the upload.
830 DCHECK(!log_upload_in_progress_);
831 if (log_upload_in_progress_)
832 return;
834 // Abort if metrics were turned off during the final info gathering.
835 if (!recording_active()) {
836 scheduler_->Stop();
837 scheduler_->UploadCancelled();
838 return;
841 if (state_ == INIT_TASK_DONE) {
842 PrepareInitialMetricsLog();
843 } else {
844 DCHECK_EQ(SENDING_LOGS, state_);
845 CloseCurrentLog();
846 OpenNewLog();
848 SendNextLog();
851 void MetricsService::SendNextLog() {
852 DCHECK_EQ(SENDING_LOGS, state_);
853 if (!reporting_active()) {
854 scheduler_->Stop();
855 scheduler_->UploadCancelled();
856 return;
858 if (!log_manager_.has_unsent_logs()) {
859 // Should only get here if serializing the log failed somehow.
860 // Just tell the scheduler it was uploaded and wait for the next log
861 // interval.
862 scheduler_->UploadFinished(true, log_manager_.has_unsent_logs());
863 return;
865 if (!log_manager_.has_staged_log())
866 log_manager_.StageNextLogForUpload();
867 SendStagedLog();
870 bool MetricsService::ProvidersHaveInitialStabilityMetrics() {
871 // Check whether any metrics provider has initial stability metrics.
872 for (size_t i = 0; i < metrics_providers_.size(); ++i) {
873 if (metrics_providers_[i]->HasInitialStabilityMetrics())
874 return true;
877 return false;
880 bool MetricsService::PrepareInitialStabilityLog() {
881 DCHECK_EQ(INITIALIZED, state_);
883 scoped_ptr<MetricsLog> initial_stability_log(
884 CreateLog(MetricsLog::INITIAL_STABILITY_LOG));
886 // Do not call NotifyOnDidCreateMetricsLog here because the stability
887 // log describes stats from the _previous_ session.
889 if (!initial_stability_log->LoadSavedEnvironmentFromPrefs())
890 return false;
892 log_manager_.PauseCurrentLog();
893 log_manager_.BeginLoggingWithLog(initial_stability_log.Pass());
895 // Note: Some stability providers may record stability stats via histograms,
896 // so this call has to be after BeginLoggingWithLog().
897 log_manager_.current_log()->RecordStabilityMetrics(
898 metrics_providers_.get(), base::TimeDelta(), base::TimeDelta());
899 RecordCurrentStabilityHistograms();
901 // Note: RecordGeneralMetrics() intentionally not called since this log is for
902 // stability stats from a previous session only.
904 log_manager_.FinishCurrentLog();
905 log_manager_.ResumePausedLog();
907 // Store unsent logs, including the stability log that was just saved, so
908 // that they're not lost in case of a crash before upload time.
909 log_manager_.PersistUnsentLogs();
911 return true;
914 void MetricsService::PrepareInitialMetricsLog() {
915 DCHECK_EQ(INIT_TASK_DONE, state_);
917 RecordCurrentEnvironment(initial_metrics_log_.get());
918 base::TimeDelta incremental_uptime;
919 base::TimeDelta uptime;
920 GetUptimes(local_state_, &incremental_uptime, &uptime);
922 // Histograms only get written to the current log, so make the new log current
923 // before writing them.
924 log_manager_.PauseCurrentLog();
925 log_manager_.BeginLoggingWithLog(initial_metrics_log_.Pass());
927 // Note: Some stability providers may record stability stats via histograms,
928 // so this call has to be after BeginLoggingWithLog().
929 MetricsLog* current_log = log_manager_.current_log();
930 current_log->RecordStabilityMetrics(metrics_providers_.get(),
931 base::TimeDelta(), base::TimeDelta());
932 current_log->RecordGeneralMetrics(metrics_providers_.get());
933 RecordCurrentHistograms();
935 log_manager_.FinishCurrentLog();
936 log_manager_.ResumePausedLog();
938 // Store unsent logs, including the initial log that was just saved, so
939 // that they're not lost in case of a crash before upload time.
940 log_manager_.PersistUnsentLogs();
942 state_ = SENDING_LOGS;
945 void MetricsService::SendStagedLog() {
946 DCHECK(log_manager_.has_staged_log());
947 if (!log_manager_.has_staged_log())
948 return;
950 DCHECK(!log_upload_in_progress_);
951 log_upload_in_progress_ = true;
953 if (!ShouldUploadLog()) {
954 SkipAndDiscardUpload();
955 return;
958 if (!log_uploader_) {
959 log_uploader_ = client_->CreateUploader(
960 base::Bind(&MetricsService::OnLogUploadComplete,
961 self_ptr_factory_.GetWeakPtr()));
964 const std::string hash =
965 base::HexEncode(log_manager_.staged_log_hash().data(),
966 log_manager_.staged_log_hash().size());
967 bool success = log_uploader_->UploadLog(log_manager_.staged_log(), hash);
968 UMA_HISTOGRAM_BOOLEAN("UMA.UploadCreation", success);
969 if (!success) {
970 // Skip this upload and hope things work out next time.
971 SkipAndDiscardUpload();
972 return;
975 HandleIdleSinceLastTransmission(true);
979 void MetricsService::OnLogUploadComplete(int response_code) {
980 DCHECK_EQ(SENDING_LOGS, state_);
981 DCHECK(log_upload_in_progress_);
982 log_upload_in_progress_ = false;
984 // Log a histogram to track response success vs. failure rates.
985 UMA_HISTOGRAM_ENUMERATION("UMA.UploadResponseStatus.Protobuf",
986 ResponseCodeToStatus(response_code),
987 NUM_RESPONSE_STATUSES);
989 bool upload_succeeded = response_code == 200;
991 // Provide boolean for error recovery (allow us to ignore response_code).
992 bool discard_log = false;
993 const size_t log_size = log_manager_.staged_log().length();
994 if (upload_succeeded) {
995 UMA_HISTOGRAM_COUNTS_10000("UMA.LogSize.OnSuccess", log_size / 1024);
996 } else if (log_size > kUploadLogAvoidRetransmitSize) {
997 UMA_HISTOGRAM_COUNTS("UMA.Large Rejected Log was Discarded",
998 static_cast<int>(log_size));
999 discard_log = true;
1000 } else if (response_code == 400) {
1001 // Bad syntax. Retransmission won't work.
1002 discard_log = true;
1005 if (upload_succeeded || discard_log) {
1006 log_manager_.DiscardStagedLog();
1007 // Store the updated list to disk now that the removed log is uploaded.
1008 log_manager_.PersistUnsentLogs();
1011 // Error 400 indicates a problem with the log, not with the server, so
1012 // don't consider that a sign that the server is in trouble.
1013 bool server_is_healthy = upload_succeeded || response_code == 400;
1014 scheduler_->UploadFinished(server_is_healthy, log_manager_.has_unsent_logs());
1016 if (server_is_healthy)
1017 client_->OnLogUploadComplete();
1020 void MetricsService::IncrementPrefValue(const char* path) {
1021 int value = local_state_->GetInteger(path);
1022 local_state_->SetInteger(path, value + 1);
1025 void MetricsService::IncrementLongPrefsValue(const char* path) {
1026 int64 value = local_state_->GetInt64(path);
1027 local_state_->SetInt64(path, value + 1);
1030 bool MetricsService::UmaMetricsProperlyShutdown() {
1031 CHECK(clean_shutdown_status_ == CLEANLY_SHUTDOWN ||
1032 clean_shutdown_status_ == NEED_TO_SHUTDOWN);
1033 return clean_shutdown_status_ == CLEANLY_SHUTDOWN;
1036 void MetricsService::AddSyntheticTrialObserver(
1037 SyntheticTrialObserver* observer) {
1038 synthetic_trial_observer_list_.AddObserver(observer);
1039 if (!synthetic_trial_groups_.empty())
1040 observer->OnSyntheticTrialsChanged(synthetic_trial_groups_);
1043 void MetricsService::RemoveSyntheticTrialObserver(
1044 SyntheticTrialObserver* observer) {
1045 synthetic_trial_observer_list_.RemoveObserver(observer);
1048 void MetricsService::RegisterSyntheticFieldTrial(
1049 const SyntheticTrialGroup& trial) {
1050 for (size_t i = 0; i < synthetic_trial_groups_.size(); ++i) {
1051 if (synthetic_trial_groups_[i].id.name == trial.id.name) {
1052 if (synthetic_trial_groups_[i].id.group != trial.id.group) {
1053 synthetic_trial_groups_[i].id.group = trial.id.group;
1054 synthetic_trial_groups_[i].start_time = base::TimeTicks::Now();
1055 NotifySyntheticTrialObservers();
1057 return;
1061 SyntheticTrialGroup trial_group = trial;
1062 trial_group.start_time = base::TimeTicks::Now();
1063 synthetic_trial_groups_.push_back(trial_group);
1064 NotifySyntheticTrialObservers();
1067 void MetricsService::RegisterMetricsProvider(
1068 scoped_ptr<MetricsProvider> provider) {
1069 DCHECK_EQ(INITIALIZED, state_);
1070 metrics_providers_.push_back(provider.Pass());
1073 void MetricsService::CheckForClonedInstall(
1074 scoped_refptr<base::SingleThreadTaskRunner> task_runner) {
1075 state_manager_->CheckForClonedInstall(task_runner);
1078 void MetricsService::NotifySyntheticTrialObservers() {
1079 FOR_EACH_OBSERVER(SyntheticTrialObserver, synthetic_trial_observer_list_,
1080 OnSyntheticTrialsChanged(synthetic_trial_groups_));
1083 void MetricsService::GetCurrentSyntheticFieldTrials(
1084 std::vector<variations::ActiveGroupId>* synthetic_trials) {
1085 DCHECK(synthetic_trials);
1086 synthetic_trials->clear();
1087 const MetricsLog* current_log = log_manager_.current_log();
1088 for (size_t i = 0; i < synthetic_trial_groups_.size(); ++i) {
1089 if (synthetic_trial_groups_[i].start_time <= current_log->creation_time())
1090 synthetic_trials->push_back(synthetic_trial_groups_[i].id);
1094 scoped_ptr<MetricsLog> MetricsService::CreateLog(MetricsLog::LogType log_type) {
1095 return make_scoped_ptr(new MetricsLog(state_manager_->client_id(),
1096 session_id_,
1097 log_type,
1098 client_,
1099 local_state_));
1102 void MetricsService::RecordCurrentEnvironment(MetricsLog* log) {
1103 std::vector<variations::ActiveGroupId> synthetic_trials;
1104 GetCurrentSyntheticFieldTrials(&synthetic_trials);
1105 log->RecordEnvironment(metrics_providers_.get(), synthetic_trials,
1106 GetInstallDate(), GetMetricsReportingEnabledDate());
1107 UMA_HISTOGRAM_COUNTS_100("UMA.SyntheticTrials.Count",
1108 synthetic_trials.size());
1111 void MetricsService::RecordCurrentHistograms() {
1112 DCHECK(log_manager_.current_log());
1113 histogram_snapshot_manager_.PrepareDeltas(
1114 base::Histogram::kNoFlags, base::Histogram::kUmaTargetedHistogramFlag);
1117 void MetricsService::RecordCurrentStabilityHistograms() {
1118 DCHECK(log_manager_.current_log());
1119 histogram_snapshot_manager_.PrepareDeltas(
1120 base::Histogram::kNoFlags, base::Histogram::kUmaStabilityHistogramFlag);
1123 void MetricsService::LogCleanShutdown() {
1124 // Redundant setting to assure that we always reset this value at shutdown
1125 // (and that we don't use some alternate path, and not call LogCleanShutdown).
1126 clean_shutdown_status_ = CLEANLY_SHUTDOWN;
1128 clean_exit_beacon_.WriteBeaconValue(true);
1129 RecordCurrentState(local_state_);
1130 local_state_->SetInteger(prefs::kStabilityExecutionPhase,
1131 MetricsService::SHUTDOWN_COMPLETE);
1134 bool MetricsService::ShouldLogEvents() {
1135 // We simply don't log events to UMA if there is a single incognito
1136 // session visible. The problem is that we always notify using the original
1137 // profile in order to simplify notification processing.
1138 return !client_->IsOffTheRecordSessionActive();
1141 void MetricsService::RecordBooleanPrefValue(const char* path, bool value) {
1142 DCHECK(IsSingleThreaded());
1143 local_state_->SetBoolean(path, value);
1144 RecordCurrentState(local_state_);
1147 void MetricsService::RecordCurrentState(PrefService* pref) {
1148 pref->SetInt64(prefs::kStabilityLastTimestampSec,
1149 base::Time::Now().ToTimeT());
1152 void MetricsService::SkipAndDiscardUpload() {
1153 log_manager_.DiscardStagedLog();
1154 scheduler_->UploadCancelled();
1155 log_upload_in_progress_ = false;
1158 } // namespace metrics