1 2009-11-24: Release version 2.33
3 * 'mogadm fsck status' will now be more accurate.
5 * Support non-GNU df (paul armstrong)
7 * Remove old !replication command. It's in `mogadm stats` and tends
8 to kill the parent process.
10 * Force refreshing internal monitoring status every 5 minutes. Help
11 work around a potential bug where state changes are missed.
13 * Use new delete queue for tempfile culling
15 * Calculate free space from available space, not total space
18 * FSCK code has new tunables to control how aggressive it runs:
19 queue_size_for_fsck (how many fids to keep in file_to_queue table)
20 queue_rate_for_fsck (how many fids to put into the table per second)
21 the generic internal queue is now tunable:
22 internal_queue_limit (how many fids to pull out of a queue table at
23 once. old default was 500).
24 All are tunable by 'mogadm settings set'
27 * Replication will now attempt to handle over replication situations.
28 FSCK will catch over replication as a policy violation, and
29 replicate will attempt to fix it. (dormando)
30 (rarely it might need multiple fscks)
32 * Since FIDSizes is no longer used, remove it and remove gearman
33 dependency. (dormando)
35 * FSCK no longer rabidly fills the queue table. (dormando)
37 * Make storage code resilient to rare deadlocks (dormando)
39 * Add 'max_handles' config option to restart a DB connection if there
40 are too many handles to it (victori).
42 2009-10-18: Release version 2.32
44 * No changes... MFS is hard to release due to mogdeps :/
46 2009-10-18: Release version 2.31
48 * Fix create_close race condition. (pipegeek)
50 * Fix bug in reporting of available disk space.
52 * Many minor bug/documentation fixes.
54 * MySQL and Postgres installs now use BIGINT's for fid/length by
55 default. (dormando, robbat2)
57 * Accept 405 responses to MKCOL from the WebDAV server since it often
58 just means that the directory already exists. (Ask Bjørn Hansen)
60 * Fix showing domain and class in mogadm output when there's only the
61 default class in the domain (Mark Imbriaco)
63 * mogstored: pidfile support
65 * New FSCK work and error for BCNT for bad devcount values (they would
66 not otherwise be caught unless the triggered another fix).
68 2009-01-12: Release version 2.30
70 * fsck will now fix broken 'devcount' cols.
72 * schema version bump to 12, and new delete worker code to use the new
73 queue system. delete should be faster and more scalable.
75 * add server setting queue_rate_for_fsck to allow rate limiting how
76 fast the new distributed fsck creates jobs. the original default of
77 10k could utterly defeat wimpy databases.
79 * Support 0 byte files.
81 * Fix a bug where rebalance would accidentally delete copies of a fid.
83 * Create new internal queue management system, with new job
84 'JobMaster'. This greatly reduces database load and increases
85 throughput for workers using the new system.
87 * Add 'no_unreachable_tracking' config. Disables more of the old
88 replication code. Use this if you have no 1.x trackers.
90 * Add hook for alternate replication sources. Obscure, but can be
93 * Add replicate_order_final_choices hook, so multidc replication may
94 pull from the local network if one copy already exists.
96 * Fix subtle bugs in process management.
98 * Allow overriding of default classes.
100 2008-08-18: Release version 2.20
102 * One more time, and hopefully never again. No changes. META.yml
105 2008-08-18: Release version 2.19
107 * No changes except for META.yml fixes.
108 _should_ be able to upload releases without owning half of CPAN.
110 2008-08-16: Release version 2.18
112 * Add repl_use_get_port option (Brian Lynch)
114 * Remove tempfile data and rows on create_close (Chaos Wang)
116 * Don't delete files accidentally during tempfile table cleanup
118 * Specify username to run the mogstored daemon (Jonathan Share)
120 * Many spelling errors corrected.
122 * PostGres locking routines did not include the hostname for the lock
125 * RFC 2518 says we "should" use a trailing slash when calling
126 MKCOL. Some servers (nginx) appears to require it. (Spotted by
129 * Add server option 'rebalance_ignore_missing' in mogilefsd config
130 that allows the rebalance to ignore files that are already missing
131 when it tries to delete them.
133 * Fix MultipleHosts replication policy to no longer return desperate
134 devices after no longer being desperate, instead returning a state
135 which will queue for retries in the future till it can be truly
136 happy. This prevents files from being replicated to all writable
137 devs when there are two few writable hosts to satisfy the policy.
139 * Add --chmod-mountpoints option to mogautomount, in an attempt to
140 prevent mogstored machines from accidentally writing to their under-
143 * New optional argument 'startpos' to fsck_reset.
145 * Use correct per-Store port, defaulting to UNIX sockets on localhost.
147 * add --dbport option to mogdbsetup (Ask Bjoern Hansen)
149 * Fix state_change() bug that could make file_to_delete_later
150 grow unnecessarily (Radu Greab)
152 * Add Mogstored::HTTPServer::None to allow using mogstored
153 with unmanaged backend DAV servers.
155 * Add experimental cmd_edit_file. Client requires DAV MOVE and
156 partial PUT on the store to use it.
158 * make mogstored and mogilefsd use the now-bundled dependency
159 Perl modules as a last resort, unless MOGILE_NO_BUILTIN_DEPS
160 environment variable is true. this makes it easier for people
161 to use mogilefs, without dependency hell. distros/packagers
162 are expected to ignore (or scoff at) this "feature". but
163 easier for people working from tarball/svn releases. bundled
164 dependencies will be updated over time, as needed.
166 * Bring the SQLite store up to date
168 * Increase the timeouts in the test suite
170 * In the test suite, also search for mogadm in /usr/local/bin/ and
171 /usr/local/sbin (spotted by Dean Wilson)
173 * SCHEMA VERSION 10: 'replpolicy' column on 'class' table.
174 it's safe to --ignore-schema-version and run this mogilefsd
175 against an older schema. new column enables per-class
176 pluggable replication policies. it was always
177 abstract, but not easy to plugin your own alternatives.
178 see doc/pluggable-replication-policies.txt
180 * add start of a MogileFS::ReplicationPolicy::Union implementation
182 * fix crash in queryworker's create_open command, when a device
183 has its directory made on a mogile storage node (the host of
184 which is being monitored by the 'monitor' process), but the
185 device isn't registered in the devices table yet.
187 2007-07-03: Release version 2.17
189 * get_paths command now takes a pathcount option, defaulting to
190 2, instead of having a hardcoded constant of 2. This allows
191 requesting more than 2 paths to a file.
193 * make fsck log over-replicated files as policy violations. it
194 won't delete over-replicated files, though, just log
196 * Document how to test MogileFS under doc/testing.txt. You really
197 should read this if you want to use MogileFS to store your data.
199 * Add a safety check to ensure that Postgresql-8.2 or newer is used.
201 * Update mogdbsetup usage, and ensure it chooses the correct admin
202 username automatically.
204 * Document the --type option to mogdbsetup.
206 * Introduce the PostGres Store implementation. The locking system might
207 still have issues, beware of stale locks! Initial prototype by Radu
208 Greab <rg@yx.ro>, and developed heavily by Robin H. Johnson
209 <robbat2@gentoo.org>.
211 * start of support to make mogstored wrap Apache + web_dav
213 * Remove duplicated [fsck] in log output of Fsck worker.
215 * Handle HiRes gettimeofday correctly to get useful results. Arrays are
216 important. This gives us correct timing output.
218 * Handle input to log() correctly. Based on the implementation of
219 Sys::Syslog::syslog(), produces no ugly warnings, and makes sure that
220 printf stuff works nicely.
222 * The unix_timestamp function belongs to the Store, not other classes.
223 Need to add test cases for process_deletes and stats.
225 * Using NULL during an INSERT for the AUTO_INCREMENT column does not
226 work very well outside of MySQL. Pass the correct number of arguments
227 to the INSERT instead.
229 * Clean up all raw usages of UNIX_TIMESTAMP() function, in preparation
230 for database independence. If you need the unix timestamp, include the output
231 of $self->unix_timestamp directly into your query.
233 * Alter $0 so that the scripts show up as '$SCRIPTNAME' in the process
234 listing instead of '/usr/bin/perl $SCRIPTNAME'. This makes init.d
235 scripts easier to manage.
237 * make mogstored's lighttpd config use mod_status for "/", so
238 mogadm doesn't think it's dead when it previously saw a 404
239 (Justin Huff <jjhuff@mspin.net>)
241 * remove Linux::AIO support, now that Perlbal has removed it as well,
242 and because Linux::AIO support (when Perlbal then chooses
243 IO::AIO) was hanging mogstored on shutdown. so also add a
244 mogstored shutdown test (after a full daemonization of it)
246 * make Mgd::validate_dbh() re-check/ping slave dbhs as well.
247 (problem mentioned by Steven Shou <steven.shou@verizon.net>)
249 * support drive identifiers above sdz (Arjan <info@bouncetek.com>)
251 * make mogstored's iostat child process properly shutdown,
252 even when iostat binary isn't installed. (previously
253 it was stuck in a loop every 10 seconds, hoping for
254 iostat to be installed, but never respecting parent
257 * Change delete worker to use new DeviceStates interface.
259 * Fix get_paths to honor new device state of 'drain' via new
260 DeviceStates interface.
262 * use significantly less CPU when re-broadcasting around
263 iostat device utilization% to child processes. (future:
264 don't rebroadcast at all and use shared memory)
266 * slightly faster fsck (do less syscalls)
268 * efficiency: don't broadcast dup state information to children
269 (track in parent what children know already)
271 * if using Gearman::Server >= 1.09, respawn the fidsizes
274 2007-05-09: Release version 2.16
276 * BUG FIX from 2.15: partially written but never closed
277 tempfiles weren't being cleaned up properly.
279 * start to work better when compiled as a stand-alone
280 PAR all-in-one script/archive
282 2007-05-07: Release version 2.15 ("fsck/drain/rebalance")
284 * minor bug fixes and enhancements for MultipleHosts replication
285 policy. bunch of corner cases now checked with new tests.
286 now easy to write more replication policy tests in future.
288 * replication policies can now return "desperate" requests,
289 signalling that a replication reassessment should be
290 enqueued for the future, to see if things could be improved.
291 (that part's not currently implemented, but the real feature
292 and motivation is that the rebalancer now won't delete a
293 DevFID if it results in a desperate move, only an ideal move. )
295 * replication policies now can optionally return a new return
296 value of the (new) type MogileFS::ReplicationRequest, which
297 has pretty accessor names, can suggest multiple places,
298 can indicate non-ideal emergency replication decisions.
299 old plugins' return values will be transparently upgraded
300 to the equivalent new return value objects.
302 * adding new device state: "drain". it's a hybrid of "dead"
303 (in that files are migrated off it) and "readonly", it that
304 it still serves traffic... it just doesn't get new files.
305 this also introduces the new object-oriented DeviceState class,
306 and device_state($name) utility function to get the DeviceState
309 * internal code cleanup. notably, kill the old & nasty legacy
310 'find_deviceid' function which was ridiculously long and hairy.
311 the two callers are now more readable with sorts/greps/etc.
313 * make mogstored's devN/usage writing process (DiskUsage) be less
314 racy with the mogilefsd monitoring code... don't open file for
315 write... open read/write, then in one write system call, write
316 the entire file, with newline padding at end to cover old data,
317 then truncate it if necessary. should remove harmless (but scary)
318 error messages previously reported by the mogilefsd monitor
319 about zero-length usage files.
321 * new protocol commands to list/set (certain) server settings,
322 with value sanity checking (see MogileFS::Config for which
323 are settable, and with what values). needed for "enable_rebalance".
324 was partially enabled before for slave settings. also needed
325 for memcached support before, which was never possible to
326 set with mogadm, only with db tweaking.
328 * make mogilefsd fsck use new mogstored fid_sizes command, to
329 do bulk stats. speeds up fscks a ton.
331 * be robust against system clocks that go backwards between
333 http://lists.danga.com/pipermail/mogilefs/2007-April/000897.html
335 * Put gearman server in mogstored process, add worker
336 'mogstored-fidsizes' which runs as subprocess of mogstored. Add
337 side-channel command 'fid_sizes' which allows us to quickly enumerate
338 and get sizes for files across entire devices on a storage node.
340 * remove all code like $state eq "readonly", $state=~ /^dead|down$/
341 and instead convert it into specific questions on policy/traits
342 of given state, like $dev->should_put_new_files_on, or
343 $dev->should_drain_files_off. see MogileFS::DeviceState,
344 objects of which are accessed via $dev->dstate, or new
345 MogileFS::Util device_state($name) wrapper.
347 * start of rebalance support. (where replication workers, in their
348 idle time, can rearrange files to even out disk space and/or IO
349 activity on storage nodes... policy isn't hard-coded, and is
350 in fact currently random)
352 * lighttpd support in both mogilefsd and mogstored. passes test
353 suite with environment MOGSTORED_SERVER_TYPE=lighttpd set now.
355 * abstract out the HTTP server support in mogstored, so
356 mogstored isn't just a perlbal wrapper, but an anything
357 wrapper. (in particular, lighttpd and apache) mogstored still
358 exists for all its other misc admin/monitoring functions,
359 but can then manage/configure apache/lighttpd child process(es).
360 so far they're just stubbed out.
362 * split mogstored into separate files per class, rather than one
365 2007-04-24: Release version 2.10
367 * bug from earlier refactoring: changing some 'next' to 'return'
368 in MogileFS::Device's create_directory method. (thanks to
369 Justin Huff <jjhuff@mspin.net>)
371 * make sure mogdbsetup doesn't upgrade an existing table (with
372 potentially lot of data) from MyISAM to InnoDB. just warn.
373 only new tables should become InnoDB on creation. also,
374 detect InnoDB even on old MySQL 4.0.x, not just 4.1/5.0.
376 * make fsck faster by reducing number of db queries (both
377 reads and writes), though more unrelated fastness remains to
380 * make fsck_status faster (keep summary stats of evcode counts,
381 rather than doing a full GROUP BY query all the time)
383 2007-04-20: Release version 2.09
385 * filesystem checker (Fsck worker). off by default, unless
386 it's enabled via protocol. Control/monitor it with mogadm
387 in MogileFS-Utils 2.08, which needs MogileFS::Client 1.06.
389 * replication speed-up on configurations with only 1 host.
390 (replication would randomly have no suggestions before, when
391 there was only 1 target host, so replication would be rescheduled
392 until the future, when random returned a different device on
395 2007-04-17: Release version 2.08
397 * (no runtime-visible changes from 2.07)
399 * mogilefsd: remove incomplete "Checker" worker, replace with
400 incomplete "Fsck" worker.
402 * test cleanup/robustness improvements. properly skips testing when
403 testing deps not found. but searches harder for them now, too.
404 tests run under "make disttest".
406 * packaging cleanups, released using ShipIt.
411 * mogilefsd: Split Plugin::MetaData and Plugin::FilePaths into their own distributions.
413 * mogilefsd: Fix mogdbsetup to not ask empty questions.
415 * mogilefsd: command line/config file support for listening on specific IPs, also
416 support for listening on multiple IPs/ports
419 * mogstored: use XS HTTPHeaders by default, unless disabled with environment variable
421 * mogilefsd: memcache support for get_paths (see doc/memcache-support.txt)
424 * mogstored: make child processes be exec'ed (as as their own scripts, not inheriting
425 Perlbal, Linux::AIO/IO::AIO, etc baggage). so they don't own listening
426 sockets, pipes to child workers, signal handlers, etc. just a lot
427 cleaner now, and SIGTERM, SIGINT properly shut everything down now,
428 both in foreground and as daemon
431 * mogstored: this changelog is totally neglected. :/ see svn log.
433 * mogstored: make perlbal's "shutdown graceful" work, by not doing
434 sidechannel accepting ourselves (which never hung up),
435 but subclass Perlbal's TCPListener class, which does it
438 2006-12/2007-01: (2.00_05)
439 * mogilefsd: everything converted to OO API internally. old internal APIs removed.
441 * mogilefsd: be database-agnostic. Currently only works with MySQL (and
442 SQLite as a joke/demo), but Pg/Oracle coming in the future.
444 * mogilefsd: built-in I/O activity monitoring between mogstored/mogilefsd Monitor
447 * mogilefsd: much improved internal communication between worker processes
449 * mogilefsd: start of better docs
452 * mogilefsd: pidfile support (Radu Greab <radu@yx.ro>)
456 * mogilefsd: remove all old internal APIs, update everything to use new OO APIs,
457 expanding OO APIs where necessary (lots of places). so much more
458 beautifully hackable now.
460 * mogstored: kill child processes (diskusage and iostat) when being killed.
463 * mogstored: set aio threads after daemonizing. needed by Linux::AIO/IO::AIO, it seems?
466 * mogilefsd: remove non-HTTP mode ("NFS mode") from the server
469 * mogstored: refuse to start up with missing Linux::AIO/IO::AIO or broken Linux::AIO
472 * mogstored: set aio_threads as a function of number of disks
475 * mogilefsd: major refactoring/cleanup/new features ("MogileFS 2.0")
478 * mogilefsd: fix dataloss bug when the 'tempfile' table is InnoDB and the
479 server is restarted while the tempfile table is empty.
482 * mogilefsd: allow create_open to take optional fid parameter which enables you to
483 dictate which fid to use internally and to not rely on MySQL's auto_increment
486 * mogilefsd: make Mgd::get_device_summary() always return all devices, regardless
489 * mogilefsd: update all callers of get_device_summary to filter on the device
490 status, if they need to. (provided more clarity into what
491 callers' intentions were...)
493 * mogilefsd: add a new device state 'readonly' which means: it can be read from
494 from clients, it can be replicated from, but no new files can go
495 on it or be replicated onto it. and it also reaps, meaning the
496 device is fucked. the filesystem probably ate itself.
499 * mogstored: write out usage files for devices, even if they're not on their own mount point.
500 also explicitly include the '-k' flag to df, so we know they're KiB.
503 * mogstored: new config syntax and command line options