Changed trials-in-common page to use a single database query to get analyses. Changed...
[sgn.git] / docs / data-analysis-tools.html
blobc3aba82ffc7d53077d4c54cb3cd9c23255061ef0
1 <!DOCTYPE html>
2 <html lang="" xml:lang="">
3 <head>
5 <meta charset="utf-8" />
6 <meta http-equiv="X-UA-Compatible" content="IE=edge" />
7 <title>Chapter 25 Data Analysis Tools | User Manual of Breedbase</title>
8 <meta name="description" content="A manual for all things Breedbase. How to create an account, search the database, manage different data types, and analyze data." />
9 <meta name="generator" content="bookdown 0.36 and GitBook 2.6.7" />
11 <meta property="og:title" content="Chapter 25 Data Analysis Tools | User Manual of Breedbase" />
12 <meta property="og:type" content="book" />
13 <meta property="og:image" content="https://solgenomics.github.io/assets/images/Breedbase_HighRes.png" />
14 <meta property="og:description" content="A manual for all things Breedbase. How to create an account, search the database, manage different data types, and analyze data." />
15 <meta name="github-repo" content="solgenomics/sgn" />
17 <meta name="twitter:card" content="summary" />
18 <meta name="twitter:title" content="Chapter 25 Data Analysis Tools | User Manual of Breedbase" />
20 <meta name="twitter:description" content="A manual for all things Breedbase. How to create an account, search the database, manage different data types, and analyze data." />
21 <meta name="twitter:image" content="https://solgenomics.github.io/assets/images/Breedbase_HighRes.png" />
23 <meta name="author" content="Breedbase team" />
26 <meta name="date" content="2024-07-22" />
28 <meta name="viewport" content="width=device-width, initial-scale=1" />
29 <meta name="apple-mobile-web-app-capable" content="yes" />
30 <meta name="apple-mobile-web-app-status-bar-style" content="black" />
33 <link rel="prev" href="managing-outliers-in-dataset.html"/>
35 <script src="libs/jquery-3.6.0/jquery-3.6.0.min.js"></script>
36 <script src="https://cdn.jsdelivr.net/npm/fuse.js@6.4.6/dist/fuse.min.js"></script>
37 <link href="libs/gitbook-2.6.7/css/style.css" rel="stylesheet" />
38 <link href="libs/gitbook-2.6.7/css/plugin-table.css" rel="stylesheet" />
39 <link href="libs/gitbook-2.6.7/css/plugin-bookdown.css" rel="stylesheet" />
40 <link href="libs/gitbook-2.6.7/css/plugin-highlight.css" rel="stylesheet" />
41 <link href="libs/gitbook-2.6.7/css/plugin-search.css" rel="stylesheet" />
42 <link href="libs/gitbook-2.6.7/css/plugin-fontsettings.css" rel="stylesheet" />
43 <link href="libs/gitbook-2.6.7/css/plugin-clipboard.css" rel="stylesheet" />
52 <link href="libs/anchor-sections-1.1.0/anchor-sections.css" rel="stylesheet" />
53 <link href="libs/anchor-sections-1.1.0/anchor-sections-hash.css" rel="stylesheet" />
54 <script src="libs/anchor-sections-1.1.0/anchor-sections.js"></script>
55 <script async defer src="https://hypothes.is/embed.js"></script>
59 <style type="text/css">
61 div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
62 </style>
64 <link rel="stylesheet" href="assets/style.css" type="text/css" />
65 </head>
67 <body>
71 <div class="book without-animation with-summary font-size-2 font-family-1" data-basepath=".">
73 <div class="book-summary">
74 <nav role="navigation">
76 <ul class="summary">
77 <li class="toc-logo"><a href="./"><img src="assets/images/Breedbase_HighRes.png"></a></li>
78 <li class="divider"></li>
79 <li class="chapter" data-level="" data-path="index.html"><a href="index.html"><i class="fa fa-check"></i>Introduction</a></li>
80 <li class="chapter" data-level="1" data-path="basic-website-usage.html"><a href="basic-website-usage.html"><i class="fa fa-check"></i><b>1</b> Basic Website Usage</a>
81 <ul>
82 <li class="chapter" data-level="1.1" data-path="basic-website-usage.html"><a href="basic-website-usage.html#creating-a-user-account"><i class="fa fa-check"></i><b>1.1</b> Creating a User Account</a>
83 <ul>
84 <li class="chapter" data-level="1.1.1" data-path="basic-website-usage.html"><a href="basic-website-usage.html#verifying-first-that-you-do-not-already-have-an-account"><i class="fa fa-check"></i><b>1.1.1</b> Verifying first that you do not already have an account</a></li>
85 <li class="chapter" data-level="1.1.2" data-path="basic-website-usage.html"><a href="basic-website-usage.html#creating-a-user-account-1"><i class="fa fa-check"></i><b>1.1.2</b> Creating a user account</a></li>
86 </ul></li>
87 <li class="chapter" data-level="1.2" data-path="basic-website-usage.html"><a href="basic-website-usage.html#managing-your-account"><i class="fa fa-check"></i><b>1.2</b> Managing your Account</a>
88 <ul>
89 <li class="chapter" data-level="1.2.1" data-path="basic-website-usage.html"><a href="basic-website-usage.html#login"><i class="fa fa-check"></i><b>1.2.1</b> Login</a></li>
90 <li class="chapter" data-level="1.2.2" data-path="basic-website-usage.html"><a href="basic-website-usage.html#editing-account-settings"><i class="fa fa-check"></i><b>1.2.2</b> Editing Account Settings</a></li>
91 <li class="chapter" data-level="1.2.3" data-path="basic-website-usage.html"><a href="basic-website-usage.html#changing-your-account-status-from-user-to-submitter"><i class="fa fa-check"></i><b>1.2.3</b> Changing Your Account Status: From “User” to “Submitter”</a></li>
92 <li class="chapter" data-level="1.2.4" data-path="basic-website-usage.html"><a href="basic-website-usage.html#submitting-feedback-on-an-sgn-database"><i class="fa fa-check"></i><b>1.2.4</b> Submitting Feedback on an SGN Database</a></li>
93 </ul></li>
94 <li class="chapter" data-level="1.3" data-path="basic-website-usage.html"><a href="basic-website-usage.html#menu-layout"><i class="fa fa-check"></i><b>1.3</b> Menu Layout</a>
95 <ul>
96 <li class="chapter" data-level="1.3.1" data-path="basic-website-usage.html"><a href="basic-website-usage.html#menu-options"><i class="fa fa-check"></i><b>1.3.1</b> Menu Options</a></li>
97 </ul></li>
98 <li class="chapter" data-level="1.4" data-path="basic-website-usage.html"><a href="basic-website-usage.html#working-with-lists"><i class="fa fa-check"></i><b>1.4</b> Working with Lists</a>
99 <ul>
100 <li class="chapter" data-level="1.4.1" data-path="basic-website-usage.html"><a href="basic-website-usage.html#creating-lists"><i class="fa fa-check"></i><b>1.4.1</b> Creating lists</a></li>
101 <li class="chapter" data-level="1.4.2" data-path="basic-website-usage.html"><a href="basic-website-usage.html#viewing-and-editing-lists"><i class="fa fa-check"></i><b>1.4.2</b> Viewing and editing lists</a></li>
102 </ul></li>
103 <li class="chapter" data-level="1.5" data-path="basic-website-usage.html"><a href="basic-website-usage.html#user-permissions"><i class="fa fa-check"></i><b>1.5</b> User Permissions</a></li>
104 </ul></li>
105 <li class="chapter" data-level="2" data-path="searching-the-database.html"><a href="searching-the-database.html"><i class="fa fa-check"></i><b>2</b> Searching the Database</a>
106 <ul>
107 <li class="chapter" data-level="2.1" data-path="searching-the-database.html"><a href="searching-the-database.html#search-wizard"><i class="fa fa-check"></i><b>2.1</b> The Search Wizard</a>
108 <ul>
109 <li class="chapter" data-level="2.1.1" data-path="searching-the-database.html"><a href="searching-the-database.html#how-the-search-wizard-works"><i class="fa fa-check"></i><b>2.1.1</b> How the Search Wizard Works</a></li>
110 <li class="chapter" data-level="2.1.2" data-path="searching-the-database.html"><a href="searching-the-database.html#how-to-use-retrieved-data"><i class="fa fa-check"></i><b>2.1.2</b> How to use retrieved data</a></li>
111 <li class="chapter" data-level="2.1.3" data-path="searching-the-database.html"><a href="searching-the-database.html#updating-the-wizard"><i class="fa fa-check"></i><b>2.1.3</b> Updating the Wizard</a></li>
112 </ul></li>
113 <li class="chapter" data-level="2.2" data-path="searching-the-database.html"><a href="searching-the-database.html#accessions-and-plot-search"><i class="fa fa-check"></i><b>2.2</b> Accessions and Plot Search</a></li>
114 <li class="chapter" data-level="2.3" data-path="searching-the-database.html"><a href="searching-the-database.html#trials-search"><i class="fa fa-check"></i><b>2.3</b> Trials Search</a></li>
115 <li class="chapter" data-level="2.4" data-path="searching-the-database.html"><a href="searching-the-database.html#trait-search"><i class="fa fa-check"></i><b>2.4</b> Trait Search</a></li>
116 <li class="chapter" data-level="2.5" data-path="searching-the-database.html"><a href="searching-the-database.html#ontology-browser"><i class="fa fa-check"></i><b>2.5</b> Ontology Browser</a></li>
117 <li class="chapter" data-level="2.6" data-path="searching-the-database.html"><a href="searching-the-database.html#search-seedlots"><i class="fa fa-check"></i><b>2.6</b> Search Seedlots</a></li>
118 </ul></li>
119 <li class="chapter" data-level="3" data-path="managing-user-roles.html"><a href="managing-user-roles.html"><i class="fa fa-check"></i><b>3</b> Managing User Roles</a>
120 <ul>
121 <li class="chapter" data-level="3.1" data-path="managing-user-roles.html"><a href="managing-user-roles.html#what-are-user-roles"><i class="fa fa-check"></i><b>3.1</b> What are User Roles?</a></li>
122 <li class="chapter" data-level="3.2" data-path="managing-user-roles.html"><a href="managing-user-roles.html#the-manage-user-roles-page"><i class="fa fa-check"></i><b>3.2</b> The Manage User Roles page</a></li>
123 </ul></li>
124 <li class="chapter" data-level="4" data-path="managing-breeding-programs.html"><a href="managing-breeding-programs.html"><i class="fa fa-check"></i><b>4</b> Managing Breeding Programs</a></li>
125 <li class="chapter" data-level="5" data-path="managing-locations.html"><a href="managing-locations.html"><i class="fa fa-check"></i><b>5</b> Managing Locations</a></li>
126 <li class="chapter" data-level="6" data-path="managing-accessions.html"><a href="managing-accessions.html"><i class="fa fa-check"></i><b>6</b> Managing Accessions</a>
127 <ul>
128 <li class="chapter" data-level="6.1" data-path="managing-accessions.html"><a href="managing-accessions.html#add-accessions-using-a-list"><i class="fa fa-check"></i><b>6.1</b> Add Accessions Using A List</a></li>
129 <li class="chapter" data-level="6.2" data-path="managing-accessions.html"><a href="managing-accessions.html#uploading-accessions-and-accessions-info-from-a-file"><i class="fa fa-check"></i><b>6.2</b> Uploading Accessions and Accession’s Info From A File</a></li>
130 <li class="chapter" data-level="6.3" data-path="managing-accessions.html"><a href="managing-accessions.html#email-alert-for-accession-upload"><i class="fa fa-check"></i><b>6.3</b> Email alert for accession upload</a></li>
131 <li class="chapter" data-level="6.4" data-path="managing-accessions.html"><a href="managing-accessions.html#add-parentage-pedigree-information-to-accessions"><i class="fa fa-check"></i><b>6.4</b> Add Parentage (Pedigree) Information to Accessions</a></li>
132 <li class="chapter" data-level="6.5" data-path="managing-accessions.html"><a href="managing-accessions.html#working-with-grafts"><i class="fa fa-check"></i><b>6.5</b> Working with grafts</a></li>
133 <li class="chapter" data-level="6.6" data-path="managing-accessions.html"><a href="managing-accessions.html#bulk-renaming-of-accessions"><i class="fa fa-check"></i><b>6.6</b> Bulk renaming of accessions</a></li>
134 </ul></li>
135 <li class="chapter" data-level="7" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html"><i class="fa fa-check"></i><b>7</b> Managing Seed Lots</a>
136 <ul>
137 <li class="chapter" data-level="7.1" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#add-new-seedlots"><i class="fa fa-check"></i><b>7.1</b> Add New Seedlot(s)</a></li>
138 <li class="chapter" data-level="7.2" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#seedlot-transactions"><i class="fa fa-check"></i><b>7.2</b> Seedlot Transactions</a></li>
139 <li class="chapter" data-level="7.3" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#seed-inventory"><i class="fa fa-check"></i><b>7.3</b> Seed Inventory</a></li>
140 <li class="chapter" data-level="7.4" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#find-seedlots-for-a-list-of-accessions"><i class="fa fa-check"></i><b>7.4</b> Find Seedlots For a List of Accessions</a></li>
141 <li class="chapter" data-level="7.5" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#create-a-seedlot-for-an-accession-or-cross"><i class="fa fa-check"></i><b>7.5</b> Create a seedlot for an Accession or Cross</a></li>
142 <li class="chapter" data-level="7.6" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#add-quality-data-to-a-seedlot"><i class="fa fa-check"></i><b>7.6</b> Add quality data to a seedlot</a></li>
143 <li class="chapter" data-level="7.7" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#seedlot-maintenance-events"><i class="fa fa-check"></i><b>7.7</b> Seedlot Maintenance Events</a>
144 <ul>
145 <li class="chapter" data-level="7.7.1" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#setup"><i class="fa fa-check"></i><b>7.7.1</b> Setup</a></li>
146 <li class="chapter" data-level="7.7.2" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#adding-events"><i class="fa fa-check"></i><b>7.7.2</b> Adding Events</a></li>
147 <li class="chapter" data-level="7.7.3" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#displaying-events"><i class="fa fa-check"></i><b>7.7.3</b> Displaying Events</a></li>
148 <li class="chapter" data-level="7.7.4" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#downloading-events"><i class="fa fa-check"></i><b>7.7.4</b> Downloading Events</a></li>
149 </ul></li>
150 <li class="chapter" data-level="7.8" data-path="managing-seed-lots.html"><a href="managing-seed-lots.html#deleting-seedlots"><i class="fa fa-check"></i><b>7.8</b> Deleting Seedlots</a></li>
151 </ul></li>
152 <li class="chapter" data-level="8" data-path="managing-populations.html"><a href="managing-populations.html"><i class="fa fa-check"></i><b>8</b> Managing Populations</a></li>
153 <li class="chapter" data-level="9" data-path="managing-crosses.html"><a href="managing-crosses.html"><i class="fa fa-check"></i><b>9</b> Managing Crosses</a>
154 <ul>
155 <li class="chapter" data-level="9.1" data-path="managing-crosses.html"><a href="managing-crosses.html#crossing-experiment"><i class="fa fa-check"></i><b>9.1</b> Crossing Experiment</a>
156 <ul>
157 <li class="chapter" data-level="9.1.1" data-path="managing-crosses.html"><a href="managing-crosses.html#add-new-crossing-experiment"><i class="fa fa-check"></i><b>9.1.1</b> Add New Crossing Experiment</a></li>
158 </ul></li>
159 <li class="chapter" data-level="9.2" data-path="managing-crosses.html"><a href="managing-crosses.html#cross"><i class="fa fa-check"></i><b>9.2</b> Cross</a>
160 <ul>
161 <li class="chapter" data-level="9.2.1" data-path="managing-crosses.html"><a href="managing-crosses.html#add-new-crosses"><i class="fa fa-check"></i><b>9.2.1</b> Add New Crosses</a></li>
162 <li class="chapter" data-level="9.2.2" data-path="managing-crosses.html"><a href="managing-crosses.html#update-crosses-by-uploading"><i class="fa fa-check"></i><b>9.2.2</b> Update Crosses by Uploading</a></li>
163 </ul></li>
164 <li class="chapter" data-level="9.3" data-path="managing-crosses.html"><a href="managing-crosses.html#cross-wishlist"><i class="fa fa-check"></i><b>9.3</b> Cross Wishlist</a>
165 <ul>
166 <li class="chapter" data-level="9.3.1" data-path="managing-crosses.html"><a href="managing-crosses.html#create-a-cross-wishlist"><i class="fa fa-check"></i><b>9.3.1</b> Create a Cross Wishlist</a></li>
167 </ul></li>
168 <li class="chapter" data-level="9.4" data-path="managing-crosses.html"><a href="managing-crosses.html#crossing-experiment-detail-page"><i class="fa fa-check"></i><b>9.4</b> Crossing Experiment Detail Page</a></li>
169 <li class="chapter" data-level="9.5" data-path="managing-crosses.html"><a href="managing-crosses.html#cross-detail-page"><i class="fa fa-check"></i><b>9.5</b> Cross Detail Page</a></li>
170 </ul></li>
171 <li class="chapter" data-level="10" data-path="managing-field-trials.html"><a href="managing-field-trials.html"><i class="fa fa-check"></i><b>10</b> Managing Field Trials</a>
172 <ul>
173 <li class="chapter" data-level="10.1" data-path="managing-field-trials.html"><a href="managing-field-trials.html#trial-detail-page"><i class="fa fa-check"></i><b>10.1</b> Trial Detail Page</a></li>
174 <li class="chapter" data-level="10.2" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-trials"><i class="fa fa-check"></i><b>10.2</b> Adding Trials</a>
175 <ul>
176 <li class="chapter" data-level="10.2.1" data-path="managing-field-trials.html"><a href="managing-field-trials.html#prerequisites"><i class="fa fa-check"></i><b>10.2.1</b> Prerequisites</a></li>
177 <li class="chapter" data-level="10.2.2" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-a-trial-by-using-add-trial-form"><i class="fa fa-check"></i><b>10.2.2</b> Adding a trial by using “Add Trial” form</a></li>
178 <li class="chapter" data-level="10.2.3" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-a-trial-from-an-uploaded-file"><i class="fa fa-check"></i><b>10.2.3</b> Adding a trial from an uploaded file</a></li>
179 <li class="chapter" data-level="10.2.4" data-path="managing-field-trials.html"><a href="managing-field-trials.html#multi-location-trials"><i class="fa fa-check"></i><b>10.2.4</b> Multi-location trials</a></li>
180 <li class="chapter" data-level="10.2.5" data-path="managing-field-trials.html"><a href="managing-field-trials.html#email-alert-for-multiple-trial-design-upload"><i class="fa fa-check"></i><b>10.2.5</b> Email alert for multiple trial design upload</a></li>
181 <li class="chapter" data-level="10.2.6" data-path="managing-field-trials.html"><a href="managing-field-trials.html#viewing-plot-layout-and-trait-heatmap"><i class="fa fa-check"></i><b>10.2.6</b> Viewing Plot Layout and Trait HeatMap</a></li>
182 <li class="chapter" data-level="10.2.7" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-additional-information-in-the-trial-detail-page"><i class="fa fa-check"></i><b>10.2.7</b> Adding additional information in the “Trial Detail” page</a></li>
183 <li class="chapter" data-level="10.2.8" data-path="managing-field-trials.html"><a href="managing-field-trials.html#downloading-the-trial-layout-from-the-trial-detail-page"><i class="fa fa-check"></i><b>10.2.8</b> Downloading the Trial Layout from the “Trial Detail” page</a></li>
184 <li class="chapter" data-level="10.2.9" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-plant-entries-to-your-trial"><i class="fa fa-check"></i><b>10.2.9</b> Adding Plant Entries To Your Trial</a></li>
185 <li class="chapter" data-level="10.2.10" data-path="managing-field-trials.html"><a href="managing-field-trials.html#adding-tissue-sample-entries-to-your-trial"><i class="fa fa-check"></i><b>10.2.10</b> Adding Tissue Sample Entries To Your Trial</a></li>
186 <li class="chapter" data-level="10.2.11" data-path="managing-field-trials.html"><a href="managing-field-trials.html#uploading-gps-coordinates-for-plots"><i class="fa fa-check"></i><b>10.2.11</b> Uploading GPS Coordinates For Plots</a></li>
187 <li class="chapter" data-level="10.2.12" data-path="managing-field-trials.html"><a href="managing-field-trials.html#uploading-additional-files-to-trial"><i class="fa fa-check"></i><b>10.2.12</b> Uploading Additional Files To Trial</a></li>
188 <li class="chapter" data-level="10.2.13" data-path="managing-field-trials.html"><a href="managing-field-trials.html#deleting-trial-data"><i class="fa fa-check"></i><b>10.2.13</b> Deleting Trial Data</a></li>
189 </ul></li>
190 </ul></li>
191 <li class="chapter" data-level="11" data-path="managing-genotyping-plates.html"><a href="managing-genotyping-plates.html"><i class="fa fa-check"></i><b>11</b> Managing Genotyping Plates</a>
192 <ul>
193 <li class="chapter" data-level="11.1" data-path="managing-genotyping-plates.html"><a href="managing-genotyping-plates.html#adding-a-new-genotyping-plate"><i class="fa fa-check"></i><b>11.1</b> Adding a New Genotyping Plate</a></li>
194 <li class="chapter" data-level="11.2" data-path="managing-genotyping-plates.html"><a href="managing-genotyping-plates.html#genotyping-plate-detail-page"><i class="fa fa-check"></i><b>11.2</b> Genotyping Plate Detail Page</a></li>
195 </ul></li>
196 <li class="chapter" data-level="12" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html"><i class="fa fa-check"></i><b>12</b> Using Field Book App</a>
197 <ul>
198 <li class="chapter" data-level="12.1" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#a-typical-workflow"><i class="fa fa-check"></i><b>12.1</b> A typical workflow</a></li>
199 <li class="chapter" data-level="12.2" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-layout-files"><i class="fa fa-check"></i><b>12.2</b> Creating Field Layout Files for the Field Book App</a>
200 <ul>
201 <li class="chapter" data-level="12.2.1" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-field-layout-files-by-using-field-book-tools-page."><i class="fa fa-check"></i><b>12.2.1</b> Creating “Field Layout Files” by using “Field Book Tools” page.</a></li>
202 <li class="chapter" data-level="12.2.2" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-field-layout-files-by-using-trial-detail-page."><i class="fa fa-check"></i><b>12.2.2</b> Creating “Field Layout Files” by using “Trial Detail” page.</a></li>
203 </ul></li>
204 <li class="chapter" data-level="12.3" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-trait-files"><i class="fa fa-check"></i><b>12.3</b> Creating Trait Files for the Field Book App</a>
205 <ul>
206 <li class="chapter" data-level="12.3.1" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-a-trait-list"><i class="fa fa-check"></i><b>12.3.1</b> Creating a Trait List</a></li>
207 <li class="chapter" data-level="12.3.2" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#creating-a-trait-file"><i class="fa fa-check"></i><b>12.3.2</b> Creating a Trait File</a></li>
208 </ul></li>
209 <li class="chapter" data-level="12.4" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#transferring-files-from-your-computer-to-android-tablet"><i class="fa fa-check"></i><b>12.4</b> Transferring Files from Your Computer to Android Tablet</a>
210 <ul>
211 <li class="chapter" data-level="12.4.1" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#files-on-your-computer"><i class="fa fa-check"></i><b>12.4.1</b> Files on your computer</a></li>
212 <li class="chapter" data-level="12.4.2" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#files-on-your-android-tablet"><i class="fa fa-check"></i><b>12.4.2</b> Files on your Android tablet</a></li>
213 </ul></li>
214 <li class="chapter" data-level="12.5" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#setting-up-field-book-app-for-data-collection"><i class="fa fa-check"></i><b>12.5</b> Setting up “Field Book App” for data collection</a></li>
215 <li class="chapter" data-level="12.6" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#exporting-files-from-field-book-app"><i class="fa fa-check"></i><b>12.6</b> Exporting Files from Field Book App</a></li>
216 <li class="chapter" data-level="12.7" data-path="using-fieldbook-app.html"><a href="using-fieldbook-app.html#uploading-pheno-files"><i class="fa fa-check"></i><b>12.7</b> Uploading Phenotype Files to an SGN database</a></li>
217 </ul></li>
218 <li class="chapter" data-level="13" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html"><i class="fa fa-check"></i><b>13</b> Managing Phenotypic Data</a>
219 <ul>
220 <li class="chapter" data-level="13.1" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#uploading-fieldbook-phenotypes"><i class="fa fa-check"></i><b>13.1</b> Uploading Fieldbook Phenotypes</a>
221 <ul>
222 <li class="chapter" data-level="13.1.1" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#export-field-book-database-file"><i class="fa fa-check"></i><b>13.1.1</b> Export Field Book Database File</a></li>
223 <li class="chapter" data-level="13.1.2" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#upload-field-book-database-file"><i class="fa fa-check"></i><b>13.1.2</b> Upload Field Book Database File</a></li>
224 </ul></li>
225 <li class="chapter" data-level="13.2" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#uploading-spreadsheet-phenotypes"><i class="fa fa-check"></i><b>13.2</b> Uploading Spreadsheet Phenotypes</a>
226 <ul>
227 <li class="chapter" data-level="13.2.1" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#generating-spreadsheet-file"><i class="fa fa-check"></i><b>13.2.1</b> Generating Spreadsheet File</a></li>
228 <li class="chapter" data-level="13.2.2" data-path="managing-phenotypic-data.html"><a href="managing-phenotypic-data.html#uploading-spreadsheet-file"><i class="fa fa-check"></i><b>13.2.2</b> Uploading Spreadsheet File</a></li>
229 </ul></li>
230 </ul></li>
231 <li class="chapter" data-level="14" data-path="managing-barcodes.html"><a href="managing-barcodes.html"><i class="fa fa-check"></i><b>14</b> Managing Barcodes</a></li>
232 <li class="chapter" data-level="15" data-path="using-the-label-designer.html"><a href="using-the-label-designer.html"><i class="fa fa-check"></i><b>15</b> Using the Label Designer</a>
233 <ul>
234 <li class="chapter" data-level="15.0.1" data-path="using-the-label-designer.html"><a href="using-the-label-designer.html#first-select-a-datasource"><i class="fa fa-check"></i><b>15.0.1</b> First Select a Datasource</a></li>
235 <li class="chapter" data-level="15.0.2" data-path="using-the-label-designer.html"><a href="using-the-label-designer.html#set-page-and-label-size"><i class="fa fa-check"></i><b>15.0.2</b> Set Page and Label Size</a></li>
236 <li class="chapter" data-level="15.0.3" data-path="using-the-label-designer.html"><a href="using-the-label-designer.html#design-your-label"><i class="fa fa-check"></i><b>15.0.3</b> Design Your Label</a></li>
237 <li class="chapter" data-level="15.0.4" data-path="using-the-label-designer.html"><a href="using-the-label-designer.html#adjust-formatting-save-and-download"><i class="fa fa-check"></i><b>15.0.4</b> Adjust Formatting, Save, and Download</a></li>
238 </ul></li>
239 <li class="chapter" data-level="16" data-path="managing-downloads.html"><a href="managing-downloads.html"><i class="fa fa-check"></i><b>16</b> Managing Downloads</a></li>
240 <li class="chapter" data-level="17" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html"><i class="fa fa-check"></i><b>17</b> Managing ODK Data Collection</a>
241 <ul>
242 <li class="chapter" data-level="17.1" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html#ona-crossing-information"><i class="fa fa-check"></i><b>17.1</b> ONA Crossing Information</a>
243 <ul>
244 <li class="chapter" data-level="17.1.1" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html#managing-ona-crossing-information"><i class="fa fa-check"></i><b>17.1.1</b> Managing ONA Crossing Information</a></li>
245 <li class="chapter" data-level="17.1.2" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html#reviewing-plant-status"><i class="fa fa-check"></i><b>17.1.2</b> Reviewing Plant Status</a></li>
246 <li class="chapter" data-level="17.1.3" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html#graphical-summary-for-performed-crosses"><i class="fa fa-check"></i><b>17.1.3</b> Graphical Summary For Performed Crosses</a></li>
247 <li class="chapter" data-level="17.1.4" data-path="managing-odk-data-collection.html"><a href="managing-odk-data-collection.html#summary-information-for-performed-crosses"><i class="fa fa-check"></i><b>17.1.4</b> Summary Information For Performed Crosses</a></li>
248 </ul></li>
249 </ul></li>
250 <li class="chapter" data-level="18" data-path="managing-tissue-samples.html"><a href="managing-tissue-samples.html"><i class="fa fa-check"></i><b>18</b> Managing Tissue Samples</a>
251 <ul>
252 <li class="chapter" data-level="18.1" data-path="managing-tissue-samples.html"><a href="managing-tissue-samples.html#tissue-samples-from-field-trials"><i class="fa fa-check"></i><b>18.1</b> Tissue samples from field trials</a></li>
253 <li class="chapter" data-level="18.2" data-path="managing-tissue-samples.html"><a href="managing-tissue-samples.html#genotyping-plate-tissue-samples-96-or-384-well-plates"><i class="fa fa-check"></i><b>18.2</b> Genotyping Plate Tissue Samples (96 or 384 well plates)</a></li>
254 </ul></li>
255 <li class="chapter" data-level="19" data-path="managing-observation-variables.html"><a href="managing-observation-variables.html"><i class="fa fa-check"></i><b>19</b> Managing Observation Variables</a>
256 <ul>
257 <li class="chapter" data-level="19.1" data-path="managing-observation-variables.html"><a href="managing-observation-variables.html#managing-observation-variables-with-traits-methods-and-scales"><i class="fa fa-check"></i><b>19.1</b> Managing Observation Variables with Traits, Methods, and Scales</a></li>
258 </ul></li>
259 <li class="chapter" data-level="20" data-path="managing-image-data.html"><a href="managing-image-data.html"><i class="fa fa-check"></i><b>20</b> Managing Image Data</a>
260 <ul>
261 <li class="chapter" data-level="20.1" data-path="managing-image-data.html"><a href="managing-image-data.html#image-phenotyping-dashboard"><i class="fa fa-check"></i><b>20.1</b> Image-Phenotyping Dashboard</a></li>
262 <li class="chapter" data-level="20.2" data-path="managing-image-data.html"><a href="managing-image-data.html#image-input"><i class="fa fa-check"></i><b>20.2</b> Image Input</a></li>
263 <li class="chapter" data-level="20.3" data-path="managing-image-data.html"><a href="managing-image-data.html#standard-process"><i class="fa fa-check"></i><b>20.3</b> Standard Process</a></li>
264 <li class="chapter" data-level="20.4" data-path="managing-image-data.html"><a href="managing-image-data.html#ground-control-points"><i class="fa fa-check"></i><b>20.4</b> Ground Control Points</a></li>
265 </ul></li>
266 <li class="chapter" data-level="21" data-path="managing-vcf-data.html"><a href="managing-vcf-data.html"><i class="fa fa-check"></i><b>21</b> Managing VCF Data</a>
267 <ul>
268 <li class="chapter" data-level="21.1" data-path="managing-vcf-data.html"><a href="managing-vcf-data.html#uploading-vcf-data"><i class="fa fa-check"></i><b>21.1</b> Uploading VCF Data</a></li>
269 <li class="chapter" data-level="21.2" data-path="managing-vcf-data.html"><a href="managing-vcf-data.html#searching-and-downloading-vcf-data"><i class="fa fa-check"></i><b>21.2</b> Searching and Downloading VCF Data</a></li>
270 <li class="chapter" data-level="21.3" data-path="managing-vcf-data.html"><a href="managing-vcf-data.html#searching-protocols"><i class="fa fa-check"></i><b>21.3</b> Searching Protocols</a></li>
271 <li class="chapter" data-level="21.4" data-path="managing-vcf-data.html"><a href="managing-vcf-data.html#detail-pages-and-deletion"><i class="fa fa-check"></i><b>21.4</b> Detail Pages and Deletion</a></li>
272 </ul></li>
273 <li class="chapter" data-level="22" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html"><i class="fa fa-check"></i><b>22</b> Managing Spectral Data</a>
274 <ul>
275 <li class="chapter" data-level="22.1" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html#upload-spectral-data"><i class="fa fa-check"></i><b>22.1</b> Upload Spectral Data</a></li>
276 <li class="chapter" data-level="22.2" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html#evaluate-and-remove-outliers"><i class="fa fa-check"></i><b>22.2</b> Evaluate and Remove Outliers</a></li>
277 <li class="chapter" data-level="22.3" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html#plot-spectra"><i class="fa fa-check"></i><b>22.3</b> Plot Spectra</a></li>
278 <li class="chapter" data-level="22.4" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html#aggregate-spectra"><i class="fa fa-check"></i><b>22.4</b> Aggregate Spectra</a></li>
279 <li class="chapter" data-level="22.5" data-path="managing-spectral-data.html"><a href="managing-spectral-data.html#references"><i class="fa fa-check"></i><b>22.5</b> References</a></li>
280 </ul></li>
281 <li class="chapter" data-level="23" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html"><i class="fa fa-check"></i><b>23</b> Managing Sequence Metadata</a>
282 <ul>
283 <li class="chapter" data-level="23.1" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#what-is-sequence-metadata"><i class="fa fa-check"></i><b>23.1</b> What is Sequence Metadata?</a></li>
284 <li class="chapter" data-level="23.2" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#loading-sequence-metadata"><i class="fa fa-check"></i><b>23.2</b> Loading Sequence Metadata</a></li>
285 <li class="chapter" data-level="23.3" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#searching-sequence-metadata"><i class="fa fa-check"></i><b>23.3</b> Searching Sequence Metadata</a>
286 <ul>
287 <li class="chapter" data-level="23.3.1" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#basic-search"><i class="fa fa-check"></i><b>23.3.1</b> Basic Search</a></li>
288 <li class="chapter" data-level="23.3.2" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#advanced-search"><i class="fa fa-check"></i><b>23.3.2</b> Advanced Search</a></li>
289 </ul></li>
290 <li class="chapter" data-level="23.4" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#marker-integration"><i class="fa fa-check"></i><b>23.4</b> Marker Integration</a></li>
291 <li class="chapter" data-level="23.5" data-path="managing-sequence-metadata.html"><a href="managing-sequence-metadata.html#sequence-metadata-api"><i class="fa fa-check"></i><b>23.5</b> Sequence Metadata API</a></li>
292 </ul></li>
293 <li class="chapter" data-level="24" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html"><i class="fa fa-check"></i><b>24</b> Managing Outliers in Dataset</a>
294 <ul>
295 <li class="chapter" data-level="24.1" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#what-is-outliers-functionality-in-dataset"><i class="fa fa-check"></i><b>24.1</b> What is Outliers Functionality in Dataset ?</a></li>
296 <li class="chapter" data-level="24.2" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#accessing-trait-visualization"><i class="fa fa-check"></i><b>24.2</b> Accessing Trait Visualization</a></li>
297 <li class="chapter" data-level="24.3" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#interpreting-visual-elements"><i class="fa fa-check"></i><b>24.3</b> Interpreting Visual Elements</a></li>
298 <li class="chapter" data-level="24.4" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#choosing-cut-off-values"><i class="fa fa-check"></i><b>24.4</b> Choosing Cut-Off Values</a></li>
299 <li class="chapter" data-level="24.5" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#setting-deviation-multiplier"><i class="fa fa-check"></i><b>24.5</b> Setting Deviation Multiplier</a></li>
300 <li class="chapter" data-level="24.6" data-path="managing-outliers-in-dataset.html"><a href="managing-outliers-in-dataset.html#utilizing-graph-controls"><i class="fa fa-check"></i><b>24.6</b> Utilizing Graph Controls</a></li>
301 </ul></li>
302 <li class="chapter" data-level="25" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html"><i class="fa fa-check"></i><b>25</b> Data Analysis Tools</a>
303 <ul>
304 <li class="chapter" data-level="25.1" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#selection-index"><i class="fa fa-check"></i><b>25.1</b> Selection Index</a></li>
305 <li class="chapter" data-level="25.2" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#genomic-selection"><i class="fa fa-check"></i><b>25.2</b> Genomic Selection</a>
306 <ul>
307 <li class="chapter" data-level="25.2.1" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#method-1"><i class="fa fa-check"></i><b>25.2.1</b> Building a Model - Method 1:</a></li>
308 <li class="chapter" data-level="25.2.2" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#method-2"><i class="fa fa-check"></i><b>25.2.2</b> Building a Model - Method 2</a></li>
309 <li class="chapter" data-level="25.2.3" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#building-a-model---method-3"><i class="fa fa-check"></i><b>25.2.3</b> Building a Model - Method 3</a></li>
310 </ul></li>
311 <li class="chapter" data-level="25.3" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#genome-browsing"><i class="fa fa-check"></i><b>25.3</b> Genome Browsing</a>
312 <ul>
313 <li class="chapter" data-level="25.3.1" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#browsing-genotype-data-by-accession"><i class="fa fa-check"></i><b>25.3.1</b> Browsing Genotype data by Accession</a></li>
314 <li class="chapter" data-level="25.3.2" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#browsing-genotype-data-by-trial"><i class="fa fa-check"></i><b>25.3.2</b> Browsing Genotype data by Trial</a></li>
315 </ul></li>
316 <li class="chapter" data-level="25.4" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#principal-component-analysis-pca"><i class="fa fa-check"></i><b>25.4</b> Principal Component Analysis (PCA)</a></li>
317 <li class="chapter" data-level="25.5" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#anova"><i class="fa fa-check"></i><b>25.5</b> ANOVA</a></li>
318 <li class="chapter" data-level="25.6" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#clustering-k-means-hierarchical"><i class="fa fa-check"></i><b>25.6</b> Clustering (K-Means, Hierarchical)</a></li>
319 <li class="chapter" data-level="25.7" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#genetic-gain"><i class="fa fa-check"></i><b>25.7</b> Genetic Gain</a></li>
320 <li class="chapter" data-level="25.8" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#kinship-and-inbreeding-coefficients"><i class="fa fa-check"></i><b>25.8</b> Kinship and Inbreeding Coefficients</a></li>
321 <li class="chapter" data-level="25.9" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#creating-crossing-groups"><i class="fa fa-check"></i><b>25.9</b> Creating Crossing Groups</a></li>
322 <li class="chapter" data-level="25.10" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#search-wizard-genomic-relationship-matrix-grm-download"><i class="fa fa-check"></i><b>25.10</b> Search Wizard Genomic Relationship Matrix (GRM) Download</a></li>
323 <li class="chapter" data-level="25.11" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#search-wizard-genome-wide-association-study-gwas"><i class="fa fa-check"></i><b>25.11</b> Search Wizard Genome Wide Association Study (GWAS)</a></li>
324 <li class="chapter" data-level="25.12" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#spectral-analysis"><i class="fa fa-check"></i><b>25.12</b> Spectral Analysis</a>
325 <ul>
326 <li class="chapter" data-level="25.12.1" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#dataset-selection"><i class="fa fa-check"></i><b>25.12.1</b> Dataset selection</a></li>
327 <li class="chapter" data-level="25.12.2" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#cross-validation"><i class="fa fa-check"></i><b>25.12.2</b> Cross-validation</a></li>
328 <li class="chapter" data-level="25.12.3" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#preprocessing"><i class="fa fa-check"></i><b>25.12.3</b> Preprocessing</a></li>
329 <li class="chapter" data-level="25.12.4" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#algorithms"><i class="fa fa-check"></i><b>25.12.4</b> Algorithms</a></li>
330 <li class="chapter" data-level="25.12.5" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#output-common-model-summary-statistics"><i class="fa fa-check"></i><b>25.12.5</b> Output: common model summary statistics</a></li>
331 <li class="chapter" data-level="25.12.6" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#export-model-for-later-use"><i class="fa fa-check"></i><b>25.12.6</b> Export model for later use</a></li>
332 <li class="chapter" data-level="25.12.7" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#predict-phenotypes-from-an-exported-model-routine-use"><i class="fa fa-check"></i><b>25.12.7</b> Predict phenotypes from an exported model (routine use)</a></li>
333 <li class="chapter" data-level="25.12.8" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#faq"><i class="fa fa-check"></i><b>25.12.8</b> FAQ</a></li>
334 </ul></li>
335 <li class="chapter" data-level="25.13" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#general-mixed-model-tool"><i class="fa fa-check"></i><b>25.13</b> General Mixed Model Tool</a></li>
336 <li class="chapter" data-level="25.14" data-path="data-analysis-tools.html"><a href="data-analysis-tools.html#genomic-prediction-of-cross-performance-gcpc"><i class="fa fa-check"></i><b>25.14</b> Genomic Prediction of Cross Performance (GCPC)</a></li>
337 </ul></li>
338 <li class="divider"></li>
339 <li><a href="https://bookdown.org/" target="blank">Published with bookdown</a></li>
340 </ul>
342 </nav>
343 </div>
345 <div class="book-body">
346 <div class="body-inner">
347 <div class="book-header" role="navigation">
348 <h1>
349 <i class="fa fa-circle-o-notch fa-spin"></i><a href="./">User Manual of Breedbase</a>
350 </h1>
351 </div>
353 <div class="page-wrapper" tabindex="-1" role="main">
354 <div class="page-inner">
356 <section class="normal" id="section-">
357 <div id="data-analysis-tools" class="section level1 hasAnchor" number="25">
358 <h1><span class="header-section-number">Chapter 25</span> Data Analysis Tools<a href="data-analysis-tools.html#data-analysis-tools" class="anchor-section" aria-label="Anchor link to header"></a></h1>
359 <p>SGN databases provides several tools for phenotype data analysis, marker-assisted selection, sequence and expression analyses, as well as ontology browser. These tools can be found in the “Analyze” menu.</p>
360 <p><img src="assets/images/image114.png" width="95%" style="display: block; margin: auto;" /></p>
361 <div id="selection-index" class="section level2 hasAnchor" number="25.1">
362 <h2><span class="header-section-number">25.1</span> Selection Index<a href="data-analysis-tools.html#selection-index" class="anchor-section" aria-label="Anchor link to header"></a></h2>
363 <p>To determine rankings of accessions based on more than one desirable trait, SGN databases provide a “Selection Index” tool that allows you to specify a weighting on each trait. To access the tool, clicking on “Selection Index” in the “Analyze” menu.</p>
364 <p><img src="assets/images/image251.png" width="95%" style="display: block; margin: auto;" /></p>
365 <p>On the Selection Index page, selecting a trial that you want to analyze.</p>
366 <p><img src="assets/images/image95.png" width="95%" style="display: block; margin: auto;" /></p>
367 <p>After you selected a trial, you can find traits that were assayed in that trial in the “Trait” box.</p>
368 <p><img src="assets/images/image78.png" width="95%" style="display: block; margin: auto;" /></p>
369 <p>Selecting a trait that you want to include in the analysis will open a new dialogue showing the selected trait and a box that you can assign a “Weight” of that trait. After you are done, you can continue by selecting another trait by clicking on “Add another trait” link.</p>
370 <p><img src="assets/images/image304.png" width="95%" style="display: block; margin: auto;" /></p>
371 <p>After you selected another trait, this page will automatically update information for you by showing all of the traits that you selected for the analysis.</p>
372 <p><img src="assets/images/image76.png" width="95%" style="display: block; margin: auto;" /></p>
373 <p>You also have options to choose a reference accession, choose to include accessions with missing phenotypes, scaling values to a reference accession. After you complete your setting, clicking on “Calculate Rankings”</p>
374 <p><img src="assets/images/image343.png" width="95%" style="display: block; margin: auto;" /></p>
375 <p>The Selection Index tool will generate rankings of accessions based on the information that you specified. You can copy the results to your system clipboard, convert the table data to CSV format, or print the data.</p>
376 <p><img src="assets/images/image326.png" width="95%" style="display: block; margin: auto;" /></p>
377 <p>Clicking on “Raw Average” will display average values of the phenotypes of those ranked accessions.</p>
378 <p><img src="assets/images/image150.png" width="95%" style="display: block; margin: auto;" /></p>
379 <p>Selection Index tool also allows you to save top ranked accessions directly to “Lists”. You can retrieve top ranked accessions by selecting a number or a percent.</p>
380 <p><img src="assets/images/image156.png" width="95%" style="display: block; margin: auto;" /></p>
381 </div>
382 <div id="genomic-selection" class="section level2 hasAnchor" number="25.2">
383 <h2><span class="header-section-number">25.2</span> Genomic Selection<a href="data-analysis-tools.html#genomic-selection" class="anchor-section" aria-label="Anchor link to header"></a></h2>
384 <p>The prediction of breeding values for a trait is a one step or two steps process, depending on what stage in your breeding cycle you are. The first step is to build a prediction model for a trait using a training population of clones with phenotype and genotype data. If you have yet to select parents for crossing for your first cycle of selection you can use the breeding values of the training population. If you are at later stages of your selection program, you need to do the second step which is applying the prediction model on your selection population. All clones in your training and selection populations must exist in the database.</p>
385 <p>To use the genomic selection tool, on <a href="http://cassavabase.org/"><em>cassavabase.org</em></a>, select ‘Genomic Selection’ from the ‘analyze’ pull-down menu.</p>
386 <p><img src="assets/images/image247.png" width="95%" style="display: block; margin: auto;" /></p>
387 <p>There are three ways to build a model for a trait.</p>
388 <div id="method-1" class="section level3 hasAnchor" number="25.2.1">
389 <h3><span class="header-section-number">25.2.1</span> Building a Model - Method 1:<a href="data-analysis-tools.html#method-1" class="anchor-section" aria-label="Anchor link to header"></a></h3>
390 <p>One way to build a model is, using a trait name, to search for trials in which the trait was phenotyped and use a trial or a combination of trials to build a model for the trait. For example, if you search for ’mosaic disease severity, you will get a list of trials you can use as training populations.</p>
391 <p><img src="assets/images/image160.png" width="95%" style="display: block; margin: auto;" /></p>
392 <p>You will get a list of trials (as shown below) in which the trait of your interested was phenotyped. From the list, you can use a single trial as a training population or combine several trails to form a training population for the prediction model of the trait. Let’s say, you want to create a training population using individuals from trials ‘cassava ibadan 2001/02’ and ‘cassava ibadan 02/03’ and build a model for ‘cassava mosaic disease severity’ using all clones from the training population.</p>
393 <p><img src="assets/images/image249.png" width="95%" style="display: block; margin: auto;" /></p>
394 <p>Select the trials to combine (the same coloured), click ‘done selecting’, click the ‘combine trials and build model’ button, and you will get a model and its output for the trait. On the model detail page, you can view the description of input data used in the model, output from the model and search interface for selection populations the model you can apply to predict their breeding values. The description of the input data for the model includes the number of phenotyped clones, and the number of markers, scatter and frequency distribution plots for the phenotype data, relationship between the phenotype data and GEBVs, population structure. The model output includes model parameters, heritability of the trait , prediction accuracy, GEBVs of the individuals from the training population and marker effects.</p>
395 <p><img src="assets/images/image330.png" width="95%" style="display: block; margin: auto;" /></p>
396 <p>Expand each section to see detailed information.</p>
397 <p>If you expand the ‘Trait phenotype data’ section, you will find plots to explore the phenotype data used in the model. You can assess the phenotype data using a scatter and histogram plots and the descriptive statistics.</p>
398 <p><img src="assets/images/image244.png" width="95%" style="display: block; margin: auto;" /></p>
399 <p><img src="assets/images/image263.png" width="95%" style="display: block; margin: auto;" /></p>
400 <p>A regression line between observed phenotypes and GEBVs shows the relationship between the two.</p>
401 <p><img src="assets/images/image83.png" width="95%" style="display: block; margin: auto;" /></p>
402 <p>You can also explore if there is any sub-clustering in the training population using PCA.</p>
403 <p><img src="assets/images/image93.png" width="95%" style="display: block; margin: auto;" /></p>
404 <p>To check the model accuracy, a 10-fold cross-validation test, expand the ‘model accuracy’ section.</p>
405 <p><img src="assets/images/image328.png" width="95%" style="display: block; margin: auto;" /></p>
406 <p>Marker effects are also available for download. To do so, expanad the ‘Marker Effects’ section and click the ‘Download all marker effects’ link and you will get a tab delimited output to save on your computer.</p>
407 <p><img src="assets/images/image74.png" width="95%" style="display: block; margin: auto;" /></p>
408 <p>The breeding values of the individuals used in the training population are displayed graphically. Mousing over each data point displays the clone and its breeding value. To examine better, you can zoom in into the plot by selecting an area on the plot. You can download them also by following the ‘Download all GEBVs’ link.</p>
409 <p><img src="assets/images/image147.png" width="95%" style="display: block; margin: auto;" /></p>
410 <div id="estimating-breeding-values-in-a-selection-population" class="section level4 unnumbered hasAnchor">
411 <h4>Estimating breeding values in a selection population<a href="data-analysis-tools.html#estimating-breeding-values-in-a-selection-population" class="anchor-section" aria-label="Anchor link to header"></a></h4>
412 <p>If you already have a selection population (in the database), from the same model page, you can apply the model to the selection population and estimate breeding values for all the clones in the population. You can search for a selection population of clones in the database using the search interface or you can make a custom list of clones using the <a href="basic-website-usage.html#working-with-lists"><em>list interface</em></a>. If you click the ‘search for all relevant selection populations’, you will see all relevant selection populations for that model. However, this option takes long time decause of the large set of populations in the database and the filtering. Therefore, the fastest way is to search for each of your selection populations by name. If you are logged in to the website you will also see a list of your custom set of genotyped clones.</p>
413 <p><img src="assets/images/image338.png" width="95%" style="display: block; margin: auto;" /></p>
414 <p>To apply the model to a selection population, simply click your population name or ‘Predict Now’ and you will get the predicted breeding values. When you see a name of (or acronym]) of the trait, follow the link and you will see an interactive plot of the breeding values and a link to download the breeding values of your selection population.</p>
415 <p><img src="assets/images/image334.png" width="95%" style="display: block; margin: auto;" /></p>
416 </div>
417 </div>
418 <div id="method-2" class="section level3 hasAnchor" number="25.2.2">
419 <h3><span class="header-section-number">25.2.2</span> Building a Model - Method 2<a href="data-analysis-tools.html#method-2" class="anchor-section" aria-label="Anchor link to header"></a></h3>
420 <p>Another way to build a model is by selecting a trial, instead of selecting and searching for a specific trait. This approach is useful when you know a particular trial that is relevant to the environment you are targeting to breed material for. This method allows you to build models and predict genomic estimated breeding values (GEBVs) for several traits within a single trial at once. You can also calculate selection index for your clones when GEBVs are estimated for multiple traits.</p>
421 <p>To do this select the “Genomic Selection” link found under the “analyze” menu. This will take you to the same home page as used with Method 1. However, instead of entering information to search for in “Search for a trait”, click on “Use a trait as a trial population”. This will expand a new menu that will show all available trials.</p>
422 <p><img src="assets/images/image344.png" width="95%" style="display: block; margin: auto;" /></p>
423 <p><img src="assets/images/image329.png" width="25%" style="display: block; margin: auto;" /></p>
424 <p><img src="assets/images/image341.png" width="95%" style="display: block; margin: auto;" /></p>
425 <p>To begin creating the model, select the existing trial that you would like to use. In this example I will be using the trial and trait data from “Cassava Ibadan 2002/03” trial. Clicking on a trial will take you to a page where you can find information such as number of markers and number of phenotypes clones.</p>
426 <p><img src="assets/images/image322.png" width="95%" style="display: block; margin: auto;" /></p>
427 <p>In addition to the number of phenotype clones and number of markers, the main page for the trial selected also has information and graphs on phenotypic correlation for all of the traits. By moving your cursor over the graph you can read the different values for correlation between two traits. A key with all of the trait names of the acronyms used can be found in the tab below the graph.</p>
428 <p><img src="assets/images/image151.png" width="95%" style="display: block; margin: auto;" /></p>
429 <p>Below the “Training population summary” there is a tab for “Traits”. Clicking on this tab will show all available traits for the specific trial. You can create a model by choosing one or multiple traits in the trial and clicking “Build Model”. In this example, the traits for “cassava bacterial blight severity” and “cassava mosaic disease severity” have been selected.</p>
430 <p><img src="assets/images/image69.png" width="95%" style="display: block; margin: auto;" /></p>
431 <p>Clicking on ‘Build Model’ will take you to a new page with the models outputs for the traits. Under the “Genomic Selection Model Output” tab you can view the model output and the model accuracy. Clicking on any of the traits will take you to a page with information about the model output on that individual trait within the trial. There you can view all of the trait information that was seen in more detail in <a href="data-analysis-tools.html#method-1"><em>Method 1</em></a>.</p>
432 <p><img src="assets/images/image336.png" width="95%" style="display: block; margin: auto;" /></p>
433 <p>You can apply the models to simultaneously predict GEBVs for respective traits in a selection population by clicking on “Predict Now” or the name of the selection population. You can also apply the models to any set of genotyped clones that you can create using the ‘lists’ feature. For more information on lists, click <a href="basic-website-usage.html#working-with-lists"><em>here</em></a>. Follow the link to the trait name to view and download the predicted GEBVs for the trait in a selection population.</p>
434 <p><img src="assets/images/image171.png" width="95%" style="display: block; margin: auto;" /></p>
435 <p>To compare clones based on their performance on multiple traits, you can calculate selection indices using the form below. Choose from the pulldown menu the population with predicted GEBVs for the traits and assign relative weights for each trait. The relative weight of each trait must be between 0 - 1. 0 being of least weight and importance, not wanting to consider that particular trait in selecting a genotype and 1 being a trait that you give highest importance.</p>
436 <p>In this example we will be using the “Cassava Ibadan 2002/03” population and assigning values to each of the traits. Remember that there is a list of acronyms and trait names at the bottom of the page for reference. After entering whatever values you would like for each trait click on the “Calculate” button to generate results. This will create a list of the top 10 genotypes that most closely match the criteria that you entered. The list will be displayed right below the “selection index” tab. This information can also be downloaded onto your computer by clicking on the “Download selection indices” link underneath the listed genotypes and selection indices.</p>
437 <p><img src="assets/images/image81.png" width="95%" style="display: block; margin: auto;" /></p>
438 </div>
439 <div id="building-a-model---method-3" class="section level3 hasAnchor" number="25.2.3">
440 <h3><span class="header-section-number">25.2.3</span> Building a Model - Method 3<a href="data-analysis-tools.html#building-a-model---method-3" class="anchor-section" aria-label="Anchor link to header"></a></h3>
441 <p>In addition to creating a model by searching for pre-existing traits or by preexisting trial name, models can also be created by using your own list of clones. This creates a model by using or creating a training population.</p>
442 <p>The page to use the third Method for creating a population model is the same as for the other two models. Select “Genomic Selection” from under the “analyze” menu of the main toolbar. This will take you to the Genomic Selection homepage and show you all three available methods to create a model. To see and use Method 3 scroll down and click on the tab labeled “Create a Training Population”. This will open a set of tools that will allow you to use pre-existing lists or to create a new list.</p>
443 <p><img src="assets/images/image138.png" width="95%" style="display: block; margin: auto;" /></p>
444 <p>Once the “Create a Training Population” tab is opened you have the option to use a pre-existing list or create new one. To learn how to create a list, click <a href="basic-website-usage.html#working-with-lists"><em>here</em></a>. The “Make a new list of plots” link will take you directly to the Search Wizard that is usually used to create lists.</p>
445 <p>Please note: the only lists that can be used in Method 3 to create a model are lists of plots and trials. If the pre-existing list is not of plots or trials (for example, traits, or locations) it will not show up and cannot be used as a training population. When you create you use a list of trials, the trials data will be combined to create a training data set.</p>
446 <p>To use your custom list of plots or trials as a training population, select the list and click “Go”. This will take you to a detail page for the training population.</p>
447 <p><img src="assets/images/image181.png" width="95%" style="display: block; margin: auto;" /></p>
448 <p>From here on you can build models and predict breeding values as described in <a href="data-analysis-tools.html#method-2"><em>Method 2</em></a><strong>.</strong></p>
449 </div>
450 </div>
451 <div id="genome-browsing" class="section level2 hasAnchor" number="25.3">
452 <h2><span class="header-section-number">25.3</span> Genome Browsing<a href="data-analysis-tools.html#genome-browsing" class="anchor-section" aria-label="Anchor link to header"></a></h2>
453 <p>There are two ways to evaluate genotype information within the browser, from an accession detail page or a trial detail page.</p>
454 <div id="browsing-genotype-data-by-accession" class="section level3 hasAnchor" number="25.3.1">
455 <h3><span class="header-section-number">25.3.1</span> Browsing Genotype data by Accession<a href="data-analysis-tools.html#browsing-genotype-data-by-accession" class="anchor-section" aria-label="Anchor link to header"></a></h3>
456 <p>If you are interested in browsing genotype information for a single accession, for example ‘BAHKYEHEMAA’, navigate to the accession detail page.</p>
457 <p><img src="assets/images/image152.png" width="50%" style="display: block; margin: auto;" /></p>
458 <p>Near the bottom of the detail page is a collapsible section called “Accession Jbrowse”.</p>
459 <p><img src="assets/images/image20.png" width="50%" style="display: block; margin: auto;" /></p>
460 <p>This section will contain a link to the accession jbrowse page if the necessary genotype data is available. Clicking the link should take you to a page that looks like this, a which point you can browsre the genotype data in the form of a vcf track aligned to the latest build of the genome.</p>
461 <p><img src="assets/images/image318.png" width="95%" style="display: block; margin: auto;" /></p>
462 </div>
463 <div id="browsing-genotype-data-by-trial" class="section level3 hasAnchor" number="25.3.2">
464 <h3><span class="header-section-number">25.3.2</span> Browsing Genotype data by Trial<a href="data-analysis-tools.html#browsing-genotype-data-by-trial" class="anchor-section" aria-label="Anchor link to header"></a></h3>
465 <p>If you are interested in browsing genotype information for the accessions within a given trial, navigate to the trial detail page.</p>
466 <p><img src="assets/images/image277.png" width="50%" style="display: block; margin: auto;" /></p>
467 <p>Halfway down the page is a collapsible section called “Trial Jbrowse”. This section will contain a link to the trial jbrowse page if the necessary genotype data for at least two accessions planted in the trial is available.</p>
468 <p><img src="assets/images/image268.png" width="50%" style="display: block; margin: auto;" /></p>
469 <p>Clicking the link should take you to a page that looks like this, a which point you can browse the genotype data in the form of vcf tracks aligned to the latest build of the genome.</p>
470 <p><img src="assets/images/image327.png" width="95%" style="display: block; margin: auto;" /></p>
471 </div>
472 </div>
473 <div id="principal-component-analysis-pca" class="section level2 hasAnchor" number="25.4">
474 <h2><span class="header-section-number">25.4</span> Principal Component Analysis (PCA)<a href="data-analysis-tools.html#principal-component-analysis-pca" class="anchor-section" aria-label="Anchor link to header"></a></h2>
475 <p>Principal component analysis helps estimate and visualize if there is sub-grouping of individuals within a dataset based on a number of variables. Currently, you can use marker data to run PCA on datasets.</p>
476 <p>You can run PCA from multiple places on the website. To do PCA on</p>
477 <ol style="list-style-type: decimal">
478 <li><p>individuals from a trial, go to the trial detail page and find the PCA tool under the ‘Analysis tools’ section.</p></li>
479 <li><p>individuals from a training population you used in a GS modeling, do your modeling and find the PCA tool in the model output page.</p></li>
480 <li><p>individuals in a training population and selection population you applied the training model, do your modeling, apply the model on the selection population and find the PCA tool on the selection population prediction output page.</p></li>
481 <li><p>individuals in a list of accessions you created, for example using the search wizard, go to the ‘Analyze’ menu and select the ‘Population Structure’, select your list of individuals and run PCA.</p></li>
482 <li><p>individuals from multiple trials, create a list of the trials using the search wizard, go to the ‘Analyze’ menu and select the ‘Population Structure’, select your list of trials and run PCA.</p></li>
483 </ol>
484 <p><img src="assets/images/pca_iita_naccri_trials.png" width="95%" style="display: block; margin: auto;" /></p>
485 <p>With all the options, you will get a interactive plot of the two PCs (shown below) that explain the largest variance. Point the cursor at any data point and you will see the individual name with its corresponding PCs scores. By clicking the ‘Download all PCs’, you can also download the 10 PCs scores in the text format.</p>
486 </div>
487 <div id="anova" class="section level2 hasAnchor" number="25.5">
488 <h2><span class="header-section-number">25.5</span> ANOVA<a href="data-analysis-tools.html#anova" class="anchor-section" aria-label="Anchor link to header"></a></h2>
489 <p>Currently, ANOVA is implemented for a single trial (single year and single location). You can do ANOVA for RCBD, CRD, Alpha and Augmented trial designs. ANOVA is done using linear mixed effects model, where the genotypes is fixed effect and the replications and blocks are random effects. Fixed effect significance level is computed using ‘lmer’ from ‘lmeTest’ R package.</p>
490 <p>You can do ANOVA from two places: trial detail and training population detail. In both cases, if the phenotype data was from the supported trial designs,</p>
491 <p>– Go to the ANOVA section down in the trial or training population page</p>
492 <p>– Select the trait of you want to perform ANOVA</p>
493 <p>– Click the ‘Run ANOVA’ and wait for the result</p>
494 <p><img src="assets/images/anova-dm.png" width="95%" style="display: block; margin: auto;" /></p>
495 </div>
496 <div id="clustering-k-means-hierarchical" class="section level2 hasAnchor" number="25.6">
497 <h2><span class="header-section-number">25.6</span> Clustering (K-Means, Hierarchical)<a href="data-analysis-tools.html#clustering-k-means-hierarchical" class="anchor-section" aria-label="Anchor link to header"></a></h2>
498 <p>The K-Means method allows you to partition a dataset into groups (K number). The hierarchical clustering, agglomerative, allows you to explore underlying similarity and visualize in a tree structure (dendrogram) the different levels of similarities (clusters) among samples. You can do clustering based on marker data, phenotype data and GEBVs. When you use phenotype data, first clone averages for each trait are calculated. Both methods use Euclidean distance as a measure of similarity. For the hierachical clustering, the complete-linkage (farthest neighbour) method is used to link up clusters.</p>
499 <p>There are three pathways to using this tool.</p>
500 <ol style="list-style-type: decimal">
501 <li>When you have data in the form of a list or dataset from the search wizard:</li>
502 </ol>
503 <ol style="list-style-type: upper-alpha">
504 <li><p>– go to the ‘Analyze’ menu and select the clustering option</p></li>
505 <li><p>– make sure you are logged in</p></li>
506 <li><p>– Select the relevant genotyping protocol, if you are clustering using genotype data</p></li>
507 <li><p>– select your list or dataset, click ‘Go’</p></li>
508 <li><p>– select clustering type</p></li>
509 <li><p>– select the data type to use</p></li>
510 <li><p>– If you are running K-Means clustering, provide the number of partitions (K). If left blank it will partition the data set into optimal numbers for the dataset.</p></li>
511 <li><p>– click the ‘Run Cluster’ and wait for the analysis to finish or queue the request and wait for an email with the analysis result.</p></li>
512 <li><p>– You can download the outputs following the download links.</p></li>
513 </ol>
514 <ol start="2" style="list-style-type: decimal">
515 <li>From the trial detail page:</li>
516 </ol>
517 <ol style="list-style-type: upper-alpha">
518 <li><p>– Go to the ‘Analysis Tools’ section</p></li>
519 <li><p>– Follow steps D to G in (1)</p></li>
520 </ol>
521 <ol start="3" style="list-style-type: decimal">
522 <li>In the solGS pipeline:</li>
523 </ol>
524 <ol style="list-style-type: upper-alpha">
525 <li>– Once you you are in a model output put page, you will see a section where you can do clustering in the same way as above (option 2).</li>
526 </ol>
527 <p>K-Means clustering:</p>
528 <p><img src="assets/images/k-means-cluster.png" width="95%" style="display: block; margin: auto;" /></p>
529 <p>Hierarchical clustering:</p>
530 <p><img src="assets/images/hclustering.png" width="95%" style="display: block; margin: auto;" /></p>
531 </div>
532 <div id="genetic-gain" class="section level2 hasAnchor" number="25.7">
533 <h2><span class="header-section-number">25.7</span> Genetic Gain<a href="data-analysis-tools.html#genetic-gain" class="anchor-section" aria-label="Anchor link to header"></a></h2>
534 <p>You can check for genetic gain by comparing the the GEBVs of a training and a selection population. You can do this in the solGS pipepline once you build a model and apply the model to predict the GEBVs of a selection population. Once at that stage, you will see a section ‘Check Genetic Gain’. Select a selection population to compare with the training population and click the ‘Check Genetic Gain’ button. The genetic gain will be visualized in boxplots. You can download the boxplot(s) as well as the GEBVs data used for the plot(s).</p>
535 <p><img src="assets/images/genetic-gain.png" width="95%" style="display: block; margin: auto;" /></p>
536 </div>
537 <div id="kinship-and-inbreeding-coefficients" class="section level2 hasAnchor" number="25.8">
538 <h2><span class="header-section-number">25.8</span> Kinship and Inbreeding Coefficients<a href="data-analysis-tools.html#kinship-and-inbreeding-coefficients" class="anchor-section" aria-label="Anchor link to header"></a></h2>
539 <p>This tool allows you to estimate genetic relatedness between a pair of individuals (kinship), homozygousity across loci in an individual (inbreeding coefficient), and genetic similarity of an individual relative to the rest of the population (averge kinship).</p>
540 <p>There are three pathways to using this tool.</p>
541 <p><strong>(1)</strong> When you have a list or dataset clones, created from the search wizard:</p>
542 <ol style="list-style-type: upper-alpha">
543 <li><p>– go to the ‘Analyze’ menu and select the kinship and inbreeding</p></li>
544 <li><p>– make sure you are logged in</p></li>
545 <li><p>– Select the genotypic protocol for the marker data</p></li>
546 <li><p>– select your list or dataset of clones, click ‘Go’</p></li>
547 <li><p>– click the ‘Run Kinship’ and wait for the analysis to finish, depending on the data size this may take minutes. You can choose to submit the analysis and wait for an email notice to view the results or wait for it to complete.</p></li>
548 <li><p>– You can download the output following the download links.</p></li>
549 </ol>
550 <p><strong>(2)</strong> From the trial detail page:</p>
551 <ol style="list-style-type: upper-alpha">
552 <li><p>– Go to the ‘Analysis Tools’ section</p></li>
553 <li><p>– Follow steps C to G in (1)</p></li>
554 </ol>
555 <p><strong>(3)</strong> In the solGS pipeline:</p>
556 <ol style="list-style-type: upper-alpha">
557 <li>– Once you you are in a model output put page, scroll down to the ‘Kinship and Inbreeding’ section and run kinship.</li>
558 </ol>
559 <p><img src="assets/images/kinship-inbreeding.png" width="95%" style="display: block; margin: auto;" /></p>
560 </div>
561 <div id="creating-crossing-groups" class="section level2 hasAnchor" number="25.9">
562 <h2><span class="header-section-number">25.9</span> Creating Crossing Groups<a href="data-analysis-tools.html#creating-crossing-groups" class="anchor-section" aria-label="Anchor link to header"></a></h2>
563 <p>If you calculate selection index based on GEBVs of multiple traits, and you want to select a certain proportion of the indexed individuals (e.g. top 10%, or bottom 10%) and then you want to partition the selected individuals into a number of groups based on their genotypes, you can use the k-means clustering method.</p>
564 <p>The procedure is:</p>
565 <ol style="list-style-type: decimal">
566 <li><p>predict GEBVs for 2 or more traits</p></li>
567 <li><p>In the models output page, calculate selection indices. Note the name of the selection index data.</p></li>
568 <li><p>Go to the clustering section,</p></li>
569 </ol>
570 <p>– select the selection index data,</p>
571 <p>– select ‘K-means’,</p>
572 <p>– select ‘Genotype’,</p>
573 <p>– in the K-numbers textbox, fill in the number of groups you want to create,</p>
574 <p>– in the selection proportion textbox, fill in the proportion of the indexed individuals you want to select, e.g. for the top 15 percent, 15. if you wish to select bottom performing, prefix the number with minus sign (e.g. -15)</p>
575 <p>– then run cluster and wait for the result.</p>
576 <p><img src="assets/images/selection_proportion_clustering.png" width="95%" style="display: block; margin: auto;" /></p>
577 </div>
578 <div id="search-wizard-genomic-relationship-matrix-grm-download" class="section level2 hasAnchor" number="25.10">
579 <h2><span class="header-section-number">25.10</span> Search Wizard Genomic Relationship Matrix (GRM) Download<a href="data-analysis-tools.html#search-wizard-genomic-relationship-matrix-grm-download" class="anchor-section" aria-label="Anchor link to header"></a></h2>
580 <p>The genomic relationship matrix (GRM) is useful for understanding underlying structure in your population. Breedbase can compute the GRM using rrBLUP. First, select accessions in the search wizard and optionally select a genotyping protocol. If no genotyping protocol is selected, the default genotyping protocol in your system is used (as defined in sgn_local.conf). Specify the minor allele frequency, missing marker data, and missing individuals data filters to apply. The GRM can be returned in a matrix format (.tsv) which shows all pairwise relationships between the selected accessions and is useful for visualization; alternatively, the GRM can be returned in a three-column format (.tsv) which is useful for programs like ASReml outside of Breedbase. The GRM can also be returned as a simple correlation heatmap image (.pdf). The GRM can be computed from parents of the selected accessions granted the parents were genotyped, by clicking the checkbox “compute from parents”; this is useful for programs where parental lines are genotyped and then hybrids are created and evaluated in the field.</p>
581 <p><img src="assets/images/search_wizard_genotype_analyses_grm.png" width="95%" style="display: block; margin: auto;" /></p>
582 </div>
583 <div id="search-wizard-genome-wide-association-study-gwas" class="section level2 hasAnchor" number="25.11">
584 <h2><span class="header-section-number">25.11</span> Search Wizard Genome Wide Association Study (GWAS)<a href="data-analysis-tools.html#search-wizard-genome-wide-association-study-gwas" class="anchor-section" aria-label="Anchor link to header"></a></h2>
585 <p>Performing a genome wide association study (GWAS) can determine genotypic markers which are significantly correlated to phenotypic traits. Breedbase can compute GWAS using rrBLUP. First, select accessions and trait(s) in the search wizard, and optionally select a genotyping protocol. If no genotyping protocol is selected, the default genotyping protocol in your system is used (as defined in sgn_local.conf). Several traits can be selected in the search wizard; if the traits are not to be treated as repeated measurements then select ‘no’ in the select box and this will tell Breedbase to return GWAS results independently for the selected traits. If the selected traits are indeed all repeated measurements then select ‘yes’ in the select box and Breedbase will return as single GWAS analysis across all the phenotypic records. Specify the minor allele frequency, missing marker data, and missing individuals data filters to apply. GWAS results can be returned in a tabular format (.tsv) where the -log10(p-values) for the selected traits are returned; alternatively, the GWAS results can be returned as Manhattan and QQ plots for the selected traits. The GWAS can be computed from parents of the selected accessions granted the parents were genotyped, by clicking the checkbox “compute from parents”; this is useful for programs where parental lines are genotyped and then hybrids are created and evaluated in the field.</p>
586 <p>The GWAS will filter the data by the input MAF and missing data filters provided. After filtering the data is imputed using an ‘EM’ method in rrBLUP. The Kinship matrix (GRM) is computed from the imputed genotypic data and used in the GWAS model. The GWAS uses fixed effects for different field trials and replicates in the phenotypic data.</p>
587 <p><img src="assets/images/search_wizard_genotype_analyses_gwas.png" width="95%" style="display: block; margin: auto;" /></p>
588 <p><img src="assets/images/search_wizard_genotype_analyses_manhattan_plot.png" width="95%" style="display: block; margin: auto;" /></p>
589 <p><img src="assets/images/search_wizard_genotype_analyses_qq_plot.png" width="95%" style="display: block; margin: auto;" /></p>
590 </div>
591 <div id="spectral-analysis" class="section level2 hasAnchor" number="25.12">
592 <h2><span class="header-section-number">25.12</span> Spectral Analysis<a href="data-analysis-tools.html#spectral-analysis" class="anchor-section" aria-label="Anchor link to header"></a></h2>
593 <p>Visible and near-infrared spectroscopy (vis-NIRS) can be related to reference phenotypes through statistical models to produce accurate phenotypic predictions for unobserved samples, increasing phenotyping throughput. This technique is commonly used for predicting traits such as total starch, protein, carotenoid, and water content in many plant breeding programs. Breedbase implements the R package <a href="https://CRAN.R-project.org/package=waves"><em>waves</em></a> to offer training, evaluation, storage, and use of vis-NIRS prediction models for a wide range of spectrometers and phenotypes.</p>
594 <p><img src="assets/images/waves_breedbase_schema.png" width="95%" style="display: block; margin: auto;" /></p>
595 <div id="dataset-selection" class="section level3 hasAnchor" number="25.12.1">
596 <h3><span class="header-section-number">25.12.1</span> Dataset selection<a href="data-analysis-tools.html#dataset-selection" class="anchor-section" aria-label="Anchor link to header"></a></h3>
597 <p>In order to initiate an analysis, the user must select one or more datasets using <a href="searching-the-database.html#search-wizard">2.1</a>. A dataset in Breedbase can contain observationUnit-level (plot-, plant-, or sample-level) trial metadata and phenotypic data from one or more trials. After navigating to the “NIRS” webpage under the “Manage” tab in Breedbase, the user can initiate an analysis and select one of these datasets as input for model training. An optional test dataset can be selected in the second step of the workflow.</p>
598 <p><img src="assets/images/manage_NIRS_prediction_workflow_intro.png" width="95%" style="display: block; margin: auto;" /></p>
599 <p><img src="assets/images/manage_NIRS_prediction_workflow_dataset.png" width="95%" style="display: block; margin: auto;" /></p>
600 </div>
601 <div id="cross-validation" class="section level3 hasAnchor" number="25.12.2">
602 <h3><span class="header-section-number">25.12.2</span> Cross-validation<a href="data-analysis-tools.html#cross-validation" class="anchor-section" aria-label="Anchor link to header"></a></h3>
603 <p>Five cross-validation schemes that represent scenarios common in plant breeding are available for this analysis. These include CV1, CV2, CV0, and CV00 as outlined below and described in depth by Jarquín et al. (2017) as well as random and stratified random sampling with a 70% training and 30% validation split. For those schemes from Jarquín et al. (2017), specific input datasets must be chosen based on genotype and environment relatedness. Cross-validation choices:
604 * <strong>Random sampling</strong> (70% training / 30% validation)
605 * <strong>Stratified random sampling</strong>, stratified based on phenotype (70% training / 30% validation)
606 * <strong>CV1</strong>, untested lines in tested environments
607 * <strong>CV2</strong>, tested lines in tested environments
608 * <strong>CV0</strong>, tested lines in untested environments
609 * <strong>CV00</strong>, untested lines in untested environments</p>
610 <p><img src="assets/images/manage_NIRS_cv.png" width="95%" style="display: block; margin: auto;" /></p>
611 </div>
612 <div id="preprocessing" class="section level3 hasAnchor" number="25.12.3">
613 <h3><span class="header-section-number">25.12.3</span> Preprocessing<a href="data-analysis-tools.html#preprocessing" class="anchor-section" aria-label="Anchor link to header"></a></h3>
614 <p>Preprocessing, also known as pretreatment, is often used to increase the signal to noise ratio in vis-NIR datasets. The <em>waves</em> function <em>DoPreprocessing()</em> applies functions from the <em>stats</em> and <em>prospectr</em> packages for common spectral preprocessing methods with the following options:
615 * Raw data (default)
616 * First derivative
617 * Second derivative
618 * Gap segment derivative
619 * Standard normal variate (SNV; Barnes et al., 1989)
620 * Savitzky-Golay polynomial smoothing (Savitzky and Golay, 1964)</p>
621 <p>For more information on preprocessing methods and implementation, see the <a href="https://CRAN.R-project.org/package=waves"><em>waves</em></a> manual, available through CRAN: <a href="https://cran.r-project.org/web/packages/waves/waves.pdf">waves.pdf</a></p>
622 <p><img src="assets/images/manage_NIRS_snv.png" width="95%" style="display: block; margin: auto;" /></p>
623 </div>
624 <div id="algorithms" class="section level3 hasAnchor" number="25.12.4">
625 <h3><span class="header-section-number">25.12.4</span> Algorithms<a href="data-analysis-tools.html#algorithms" class="anchor-section" aria-label="Anchor link to header"></a></h3>
626 <p>Several algorithms are available for calibration model development in Breedbase via the <a href="https://CRAN.R-project.org/package=waves"><em>waves</em></a> package. The <em>TrainSpectralModel()</em> function in waves performs hyperparameter tuning as applicable using these algorithms in combination with cross validation and train functions from the package <em>caret</em>. Currently, only regression algorithms are available, but classification algorithms such as PLS-DA and SVM clasification are under development.
627 * <strong>Partial least squares regression</strong> (PLSR; Wold et al., 1982; Wold et al., 1984) is a popular method for spectral calibrations, as it can handle datasets with high levels of collinearity, reducing the dimensionality of these data into orthogonal latent variables (components) that are then related to the response variable through a linear model (reviewed in Wold et al., 2001). To avoid overfitting, the number of these components included in the final model must be tuned for each use case. The PLSR algorithm from the <em>pls</em> package is implemented by waves.
628 * <strong>Random Forest regression</strong> (RF; Ho, 1995) is a machine learning algorithm based on a series of decision trees. The number of trees and decisions at each junction are hyperparameters that must be tuned for each model. Another feature of this algorithm is the ability to extract variable importance measures from a fitted model (Breiman, 2001). In Breedbase, this option is made available through implementation of the RF algorithm from the package randomForest in the waves function TrainSpectralModel(). This function outputs both model performance statistics and a downloadable table of importance values for each wavelength. It is worth noting that this algorithm is computationally intensive, so the user should not be alarmed if results do not come right away. Breedbase will continue to work in the background and will display results when the analysis is finished.
629 * <strong>Support vector machine regression</strong> (SVM; Vapnik, 2000) is another useful algorithm for working with high-dimension datasets consisting of non-linear data, with applications in both classification and regression. The package waves implements SVM with both linear and radial basis function kernels using the kernlab package.</p>
630 </div>
631 <div id="output-common-model-summary-statistics" class="section level3 hasAnchor" number="25.12.5">
632 <h3><span class="header-section-number">25.12.5</span> Output: common model summary statistics<a href="data-analysis-tools.html#output-common-model-summary-statistics" class="anchor-section" aria-label="Anchor link to header"></a></h3>
633 <p>After training, model performance statistics are both displayed on a results webpage and made available for download in .csv format. These statistics are calculated by the <em>TrainSpectralModel()</em> function in <a href="https://CRAN.R-project.org/package=waves"><em>waves</em></a> using the <em>caret</em> and <em>spectacles</em> packages. Reported statistics include:
634 * Tuned parameters depending on the model algoritm
635 * <strong>Best.n.comp</strong>, the best number of components to be included in a PLSR model
636 * <strong>Best.ntree</strong>, the best number of trees in an RF model
637 * <strong>Best.mtry</strong>, the best number of variables to include at every decision point in an RF model
638 * <strong>RMSECV</strong>, the root mean squared error of cross-validation
639 * <strong>R<sup>2</sup><sub>cv</sub></strong>, the coefficient of multiple determination of cross-validation for PLSR models
640 * <strong>RMSEP</strong>, the root mean squared error of prediction
641 * <strong>R<sup>2</sup><sub>p</sub></strong>, the squared Pearson’s correlation between predicted and observed test set values
642 * <strong>RPD</strong>, the ratio of standard deviation of observed test set values to RMSEP
643 * <strong>RPIQ</strong>, the ratio of performance to interquartile distance
644 * <strong>CCC</strong>, the concordance correlation coefficient
645 * <strong>Bias</strong>, the average difference between the predicted and observed values
646 * <strong>SEP</strong>, the standard error of prediction
647 * <strong>R<sup>2</sup><sub>sp</sub></strong>, the squared Spearman’s rank correlation between predicted and observed test set values</p>
648 </div>
649 <div id="export-model-for-later-use" class="section level3 hasAnchor" number="25.12.6">
650 <h3><span class="header-section-number">25.12.6</span> Export model for later use<a href="data-analysis-tools.html#export-model-for-later-use" class="anchor-section" aria-label="Anchor link to header"></a></h3>
651 <p>Once a model has been trained, it can be stored for later use. This action calls the <em>SaveModel()</em> function from <a href="https://CRAN.R-project.org/package=waves"><em>waves</em></a>. Metadata regarding the training dataset and other parameters specified by the user upon training initialization are stored alongside the model object itself in the database.</p>
652 <p><img src="assets/images/manage_NIRS_export_model.png" width="95%" style="display: block; margin: auto;" /></p>
653 </div>
654 <div id="predict-phenotypes-from-an-exported-model-routine-use" class="section level3 hasAnchor" number="25.12.7">
655 <h3><span class="header-section-number">25.12.7</span> Predict phenotypes from an exported model (routine use)<a href="data-analysis-tools.html#predict-phenotypes-from-an-exported-model-routine-use" class="anchor-section" aria-label="Anchor link to header"></a></h3>
656 <p>For phenotype predictions, users select a dataset and can then choose from models in the database that were trained using the same spectrometer type as the spectral data in the chosen dataset. Predicted phenotypes are stored as such in the database and are tagged with an ontology term specifying that they are predicted and not directly measured. Metadata regarding the model used for prediction is stored alongside the predicted value in the database. Predicted phenotypes can then be used as normal in other Breedbase analysis tools such as the Selection Index and GWAS.</p>
657 <p><img src="assets/images/manage_NIRS_select_model.png" width="95%" style="display: block; margin: auto;" /></p>
658 <p><img src="assets/images/manage_NIRS_prediction_results.png" width="95%" style="display: block; margin: auto;" /></p>
659 </div>
660 <div id="faq" class="section level3 hasAnchor" number="25.12.8">
661 <h3><span class="header-section-number">25.12.8</span> FAQ<a href="data-analysis-tools.html#faq" class="anchor-section" aria-label="Anchor link to header"></a></h3>
662 <p>The Breedbase Spectral Analysis Tool does not allow for prediction models involving data from multiple spectrometer types at once.</p>
663 <p>References
664 * Barnes, R.J., M.S. Dhanoa, and S.J. Lister. 1989. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Appl. Spectrosc. 43(5): 772-777. doi: 10.1366/0003702894202201.
665 * Breiman, L. 2001. Random forests. Mach. Learn. 45: 5-32. doi: 10.1201/9780429469275-8.
666 * Ho, T.K. 1995. Random decision forests. Proc. Int. Conf. Doc. Anal. Recognition, ICDAR 1: 278-282. doi: 10.1109/ICDAR.1995.598994.
667 * Jarquín, D., C. Lemes da Silva, R.C. Gaynor, J. Poland, A. Fritz, et al. 2017. Increasing Genomic-Enabled Prediction Accuracy by Modeling Genotype x Environment Interactions in Kansas Wheat. Plant Genome 10(2): plantgenome2016.12.0130. doi: 10.3835/plantgenome2016.12.0130.
668 * Johnson, R.A., and D.W. Wichern. 2007. Applied Multivariate Statistical Analysis (6th Edition).
669 De Maesschalck, R., D. Jouan-Rimbaud, and D.L. Massart. 2000. The Mahalanobis distance. Chemom. Intell. Lab. Syst. 50(1): 1-18. doi: 10.1016/S0169-7439(99)00047-7.
670 * Mahalanobis, P.C. 1936. On the generalized distance in statistics. Natl. Inst. Sci. India.
671 * Savitzky, A., and M.J.E. Golay. 1964. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 36(8): 1627-1639. doi: 10.1021/ac60214a047.
672 * Shrestha, R., L. Matteis, M. Skofic, A. Portugal, G. McLaren, et al. 2012. Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice. Front. Physiol. 3 AUG(August): 1-10. doi: 10.3389/fphys.2012.00326.
673 * Vapnik, V.N. 2000. The Nature of Statistical Learning Theory. Springer New York, New York, NY.
674 * Wold, S., A. Ruhe, H. Wold, and W.J. Dunn, III. 1984. The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses. SIAM J. Sci. Stat. Comput. 5(3): 735-743. doi: 10.1137/0905052.
675 * Wold, S., M. Sjöström, and L. Eriksson. 2001. PLS-regression: a basic tool of chemometrics. Chemom. Intell. Lab. Syst. 58(2): 109-130. doi: 10.1016/S0169-7439(01)00155-1.</p>
676 </div>
677 </div>
678 <div id="general-mixed-model-tool" class="section level2 hasAnchor" number="25.13">
679 <h2><span class="header-section-number">25.13</span> General Mixed Model Tool<a href="data-analysis-tools.html#general-mixed-model-tool" class="anchor-section" aria-label="Anchor link to header"></a></h2>
680 <p>The general mixed model tool is available at <a href="/tools/mixedmodels">/tools/mixedmodels</a> and a link is provided from the Analyze menu.</p>
681 <p>To use the mixed model tool, first create dataset using the Wizard containing the data that you would like to analyze.</p>
682 <p>Select the Mixed Model tool from the Analyze menu.</p>
683 <p>You are presented with a workflow. On the first step of the workflow, select the dataset that you wish to analyze, click on “Choose dataset” to continue.</p>
684 <p>The second part of the workflow presents you with the traits in the dataset; you can select one or more traits from the lists using the select buttons. If you selected one trait, a bargraph of the trait distribution will be shown. Click the “Next step” button to move to the next screen.</p>
685 <p><img src="assets/images/mixedmodel_tool_model_build_step.png" width="95%" style="display: block; margin: auto;" /></p>
686 <p>On the model build screen, all the factors are displayed that are contained within the dataset. The factors are presented as a list of blue buttons that can be dragged using the mouse to areas on the screen which build a mixed model equation. The areas correspond to fixed factors, random factors, and optionally to more complex factors, such as fixed factors with interaction and fixe factors with vriable slope/intersects. Drag the available factors to the corresponding area. To calculate BLUPs for germplasm, drag the germplasmName button to the “Random factors” area. To calculate BLUEs, drag it to the “Fixed factors” area. The factors need to have different levels contained within them, for example, if there is only one trial in the dataset, it cannot be used as one of the factors. Click on “Run analysis and got to next step” to run the mixed model and display the results.</p>
687 <p>The result view contains two tabs, one with the raw data, either BLUPS or BLUEs, and the other the adjusted means from the raw data.</p>
688 <p>The results can be stored in the database as an analysis, by clicking the button provided on the top of the data.</p>
689 </div>
690 <div id="genomic-prediction-of-cross-performance-gcpc" class="section level2 hasAnchor" number="25.14">
691 <h2><span class="header-section-number">25.14</span> Genomic Prediction of Cross Performance (GCPC)<a href="data-analysis-tools.html#genomic-prediction-of-cross-performance-gcpc" class="anchor-section" aria-label="Anchor link to header"></a></h2>
692 <p>The GCPC tool is available at <a href="/tools/gcpc">/tools/gcpc</a> and a link is provided from the Analyze menu.
693 The GCPC tool implements genomic prediction with additive and directional dominance in the linear mixed model to predict for cross performance.</p>
694 <p>Before using the tool, first create a dataset using the Wizard containing the data that you would like to analyze. (The dataset should have genotyping_protocols).
695 Second, create Selection Indices for your traits using Selection Index in Analyze Menu.</p>
696 <p>To use the tool, Select the GCPC tool from the Analyze menu.</p>
697 <p>Then, select the dataset with genotyping_protocols that you wish to analyze, click on “Proceed to Factor Selection” to load available factors that can be included in the model.</p>
698 <p>Select the factors you wish to include in the model either as Fixed or Random. Click “None” for factors that you don’t want to include in the model. Note that the “germplasmName” is factored as Random by default.</p>
699 <p>The next step is to select the selection index for your traits on the dropdown menu.</p>
700 <p>Once you are through, click “Run GCPC” to run the model. The output will be presented in form of a table with “ID”, “Parent1”, “Parent2” and their cross prediction merit organized in descending order.
701 The results will also have sex information based on whether the dataset has plant sexes available in the database.</p>
703 </div>
704 </div>
705 </section>
707 </div>
708 </div>
709 </div>
710 <a href="managing-outliers-in-dataset.html" class="navigation navigation-prev navigation-unique" aria-label="Previous page"><i class="fa fa-angle-left"></i></a>
712 </div>
713 </div>
714 <script src="libs/gitbook-2.6.7/js/app.min.js"></script>
715 <script src="libs/gitbook-2.6.7/js/clipboard.min.js"></script>
716 <script src="libs/gitbook-2.6.7/js/plugin-search.js"></script>
717 <script src="libs/gitbook-2.6.7/js/plugin-sharing.js"></script>
718 <script src="libs/gitbook-2.6.7/js/plugin-fontsettings.js"></script>
719 <script src="libs/gitbook-2.6.7/js/plugin-bookdown.js"></script>
720 <script src="libs/gitbook-2.6.7/js/jquery.highlight.js"></script>
721 <script src="libs/gitbook-2.6.7/js/plugin-clipboard.js"></script>
722 <script>
723 gitbook.require(["gitbook"], function(gitbook) {
724 gitbook.start({
725 "sharing": false,
726 "fontsettings": {
727 "theme": "white",
728 "family": "sans",
729 "size": 2
731 "edit": {
732 "link": null,
733 "text": null
735 "history": {
736 "link": null,
737 "text": null
739 "view": {
740 "link": null,
741 "text": null
743 "download": ["BreedbaseManual.pdf"],
744 "search": {
745 "engine": "fuse",
746 "options": null
748 "toc": {
749 "collapse": "section"
751 "info": false
754 </script>
756 <!-- dynamically load mathjax for compatibility with self-contained -->
757 <script>
758 (function () {
759 var script = document.createElement("script");
760 script.type = "text/javascript";
761 var src = "true";
762 if (src === "" || src === "true") src = "https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.9/latest.js?config=TeX-MML-AM_CHTML";
763 if (location.protocol !== "file:")
764 if (/^https?:/.test(src))
765 src = src.replace(/^https?:/, '');
766 script.src = src;
767 document.getElementsByTagName("head")[0].appendChild(script);
768 })();
769 </script>
770 </body>
772 </html>