2 * Library for simple URI parsing and manipulation.
4 * Intended to be minimal, but featureful; do not expect full RFC 3986 compliance. The use cases we
5 * have in mind are constructing 'next page' or 'previous page' URLs, detecting whether we need to
6 * use cross-domain proxies for an API, constructing simple URL-based API calls, etc. Parsing here
7 * is regex-based, so may not work on all URIs, but is good enough for most.
9 * You can modify the properties directly, then use the #toString method to extract the full URI
10 * string again. Example:
12 * var uri = new mw.Uri( 'http://example.com/mysite/mypage.php?quux=2' );
14 * if ( uri.host == 'example.com' ) {
15 * uri.host = 'foo.example.com';
16 * uri.extend( { bar: 1 } );
18 * $( 'a#id1' ).attr( 'href', uri );
19 * // anchor with id 'id1' now links to http://foo.example.com/mysite/mypage.php?bar=1&quux=2
21 * $( 'a#id2' ).attr( 'href', uri.clone().extend( { bar: 3, pif: 'paf' } ) );
22 * // anchor with id 'id2' now links to http://foo.example.com/mysite/mypage.php?bar=3&quux=2&pif=paf
26 * `http://usr:pwd@www.example.com:81/dir/dir.2/index.htm?q1=0&&test1&test2=&test3=value+%28escaped%29&r=1&r=2#top`
27 * the returned object will have the following properties:
32 * host 'www.example.com'
34 * path '/dir/dir.2/index.htm'
39 * test3: 'value (escaped)'
44 * (N.b., 'password' is technically not allowed for HTTP URIs, but it is possible with other kinds
47 * Parsing based on parseUri 1.2.2 (c) Steven Levithan <http://stevenlevithan.com>, MIT License.
48 * <http://stevenlevithan.com/demo/parseuri/js/>
53 ( function ( mw, $ ) {
55 * Function that's useful when constructing the URI string -- we frequently encounter the pattern
56 * of having to add something to the URI as we go, but only if it's present, and to include a
57 * character before or after if so.
61 * @param {string|undefined} pre To prepend
62 * @param {string} val To include
63 * @param {string} post To append
64 * @param {boolean} raw If true, val will not be encoded
65 * @return {string} Result
67 function cat( pre, val, post, raw ) {
68 if ( val === undefined || val === null || val === '' ) {
71 return pre + ( raw ? val : mw.Uri.encode( val ) ) + post;
75 * Regular expressions to parse many common URIs.
79 * @property {Object} parser
82 strict: /^(?:([^:\/?#]+):)?(?:\/\/(?:(?:([^:@\/?#]*)(?::([^:@\/?#]*))?)?@)?([^:\/?#]*)(?::(\d*))?)?((?:[^?#\/]*\/)*[^?#]*)(?:\?([^#]*))?(?:#(.*))?/,
83 loose: /^(?:(?![^:@]+:[^:@\/]*@)([^:\/?#.]+):)?(?:\/\/)?(?:(?:([^:@\/?#]*)(?::([^:@\/?#]*))?)?@)?([^:\/?#]*)(?::(\d*))?((?:\/(?:[^?#](?![^?#\/]*\.[^?#\/.]+(?:[?#]|$)))*\/?)?[^?#\/]*)(?:\?([^#]*))?(?:#(.*))?/
87 * The order here matches the order of captured matches in the `parser` property regexes.
91 * @property {Array} properties
105 * @property {string} protocol For example `http` (always present)
108 * @property {string|undefined} user For example `usr`
111 * @property {string|undefined} password For example `pwd`
114 * @property {string} host For example `www.example.com` (always present)
117 * @property {string|undefined} port For example `81`
120 * @property {string} path For example `/dir/dir.2/index.htm` (always present)
123 * @property {Object} query For example `{ a: '0', b: '', c: 'value' }` (always present)
126 * @property {string|undefined} fragment For example `top`
130 * A factory method to create a variation of mw.Uri with a different default location (for
131 * relative URLs, including protocol-relative URLs). Used so the library is still testable &
137 mw.UriRelative = function ( documentLocation ) {
144 * Construct a new URI object. Throws error if arguments are illegal/impossible, or
145 * otherwise don't parse.
147 * @param {Object|string} [uri] URI string, or an Object with appropriate properties (especially
148 * another URI object to clone). Object must have non-blank `protocol`, `host`, and `path`
149 * properties. If omitted (or set to `undefined`, `null` or empty string), then an object
150 * will be created for the default `uri` of this constructor (`document.location` for
151 * mw.Uri, other values for other instances -- see mw.UriRelative for details).
152 * @param {Object|boolean} [options] Object with options, or (backwards compatibility) a boolean
154 * @param {boolean} [options.strictMode=false] Trigger strict mode parsing of the url.
155 * @param {boolean} [options.overrideKeys=false] Whether to let duplicate query parameters
156 * override each other (`true`) or automagically convert them to an array (`false`).
158 function Uri( uri, options ) {
159 options = typeof options === 'object' ? options : { strictMode: !!options };
160 options = $.extend( {
165 if ( uri !== undefined && uri !== null && uri !== '' ) {
166 if ( typeof uri === 'string' ) {
167 this.parse( uri, options );
168 } else if ( typeof uri === 'object' ) {
169 // Copy data over from existing URI object
170 for ( var prop in uri ) {
171 // Only copy direct properties, not inherited ones
172 if ( uri.hasOwnProperty( prop ) ) {
173 // Deep copy object properties
174 if ( $.isArray( uri[prop] ) || $.isPlainObject( uri[prop] ) ) {
175 this[prop] = $.extend( true, {}, uri[prop] );
177 this[prop] = uri[prop];
186 // If we didn't get a URI in the constructor, use the default one.
187 return defaultUri.clone();
190 // protocol-relative URLs
191 if ( !this.protocol ) {
192 this.protocol = defaultUri.protocol;
196 this.host = defaultUri.host;
199 this.port = defaultUri.port;
202 if ( this.path && this.path.charAt( 0 ) !== '/' ) {
203 // A real relative URL, relative to defaultUri.path. We can't really handle that since we cannot
204 // figure out whether the last path component of defaultUri.path is a directory or a file.
205 throw new Error( 'Bad constructor arguments' );
207 if ( !( this.protocol && this.host && this.path ) ) {
208 throw new Error( 'Bad constructor arguments' );
213 * Encode a value for inclusion in a url.
215 * Standard encodeURIComponent, with extra stuff to make all browsers work similarly and more
216 * compliant with RFC 3986. Similar to rawurlencode from PHP and our JS library
217 * mw.util.rawurlencode, except this also replaces spaces with `+`.
220 * @param {string} s String to encode
221 * @return {string} Encoded string for URI
223 Uri.encode = function ( s ) {
224 return encodeURIComponent( s )
225 .replace( /!/g, '%21' ).replace( /'/g, '%27' ).replace( /\(/g, '%28' )
226 .replace( /\)/g, '%29' ).replace( /\*/g, '%2A' )
227 .replace( /%20/g, '+' );
231 * Decode a url encoded value.
233 * Reversed #encode. Standard decodeURIComponent, with addition of replacing
237 * @param {string} s String to decode
238 * @return {string} Decoded string
240 Uri.decode = function ( s ) {
241 return decodeURIComponent( s.replace( /\+/g, '%20' ) );
247 * Parse a string and set our properties accordingly.
250 * @param {string} str URI, see constructor.
251 * @param {Object} options See constructor.
253 parse: function ( str, options ) {
257 // Apply parser regex and set all properties based on the result
258 matches = parser[ options.strictMode ? 'strict' : 'loose' ].exec( str );
259 $.each( properties, function ( i, property ) {
260 uri[ property ] = matches[ i + 1 ];
263 // uri.query starts out as the query string; we will parse it into key-val pairs then make
264 // that object the "query" property.
265 // we overwrite query in uri way to make cloning easier, it can use the same list of properties.
267 // using replace to iterate over a string
269 uri.query.replace( /(?:^|&)([^&=]*)(?:(=)([^&]*))?/g, function ( $0, $1, $2, $3 ) {
272 k = Uri.decode( $1 );
273 v = ( $2 === '' || $2 === undefined ) ? null : Uri.decode( $3 );
275 // If overrideKeys, always (re)set top level value.
276 // If not overrideKeys but this key wasn't set before, then we set it as well.
277 if ( options.overrideKeys || q[ k ] === undefined ) {
280 // Use arrays if overrideKeys is false and key was already seen before
282 // Once before, still a string, turn into an array
283 if ( typeof q[ k ] === 'string' ) {
287 if ( $.isArray( q[ k ] ) ) {
298 * Get user and password section of a URI.
302 getUserInfo: function () {
303 return cat( '', this.user, cat( ':', this.password, '' ) );
307 * Get host and port section of a URI.
311 getHostPort: function () {
312 return this.host + cat( ':', this.port, '' );
316 * Get the userInfo, host and port section of the URI.
318 * In most real-world URLs this is simply the hostname, but the definition of 'authority' section is more general.
322 getAuthority: function () {
323 return cat( '', this.getUserInfo(), '@' ) + this.getHostPort();
327 * Get the query arguments of the URL, encoded into a string.
329 * Does not preserve the original order of arguments passed in the URI. Does handle escaping.
333 getQueryString: function () {
335 $.each( this.query, function ( key, val ) {
336 var k = Uri.encode( key ),
337 vals = $.isArray( val ) ? val : [ val ];
338 $.each( vals, function ( i, v ) {
341 } else if ( k === 'title' ) {
342 args.push( k + '=' + mw.util.wikiUrlencode( v ) );
344 args.push( k + '=' + Uri.encode( v ) );
348 return args.join( '&' );
352 * Get everything after the authority section of the URI.
356 getRelativePath: function () {
357 return this.path + cat( '?', this.getQueryString(), '', true ) + cat( '#', this.fragment, '' );
361 * Get the entire URI string.
363 * May not be precisely the same as input due to order of query arguments.
365 * @return {string} The URI string
367 toString: function () {
368 return this.protocol + '://' + this.getAuthority() + this.getRelativePath();
374 * @return {Object} New URI object with same properties
377 return new Uri( this );
381 * Extend the query section of the URI with new parameters.
383 * @param {Object} parameters Query parameters to add to ours (or to override ours with) as an
385 * @return {Object} This URI object
387 extend: function ( parameters ) {
388 $.extend( this.query, parameters );
393 defaultUri = new Uri( documentLocation );
398 // If we are running in a browser, inject the current document location (for relative URLs).
399 if ( document && document.location && document.location.href ) {
400 mw.Uri = mw.UriRelative( document.location.href );
403 }( mediaWiki, jQuery ) );