description | Given a wikipedia article, build a graph of article links |
repository URL | https://github.com/dueringa/WikiWalker.git |
owner | andreas@duering-andreas.de |
last change | Thu, 19 Apr 2018 18:40:03 +0000 (19 20:40 +0200) |
last refresh | Thu, 21 Nov 2024 08:20:33 +0000 (21 09:20 +0100) |
mirror URL | git://repo.or.cz/dueringa_WikiWalker.git |
https://repo.or.cz/dueringa_WikiWalker.git | |
ssh://git@repo.or.cz/dueringa_WikiWalker.git | |
bundle info | dueringa_WikiWalker.git downloadable bundles |
content tags |
Given a Wikipedia article, build a graph of article links
Input can be any Wikipedia URL. The results can be stored in a JSON cache file. This cache file can be used in successive runs, whereas on each start the cache file is read, combined with the Wikipedia data, and stored again.
CMake is used as build system. You need curl and (boost::program_options or getopt).
Debian ships an old version of UnitTest++, which also uses a custom pkg-config file Unfortunately, this is completely incompatible with e.g. Gentoo, and the current version on GitHub, see also here.
Why would you want to do that?
You'll need
By default, cmake wants to use dynamic boost libs, but by default, boost bootstrap builds static libs only. So you need to set Boost_USE_STATIC_LIBS=ON in addition to BOOST_ROOT in cmake.
Alternatively, have a look at vcpkg, a tool to build and install libraries under Windows, and make them findable by cmake.
Since graphs can get very wide, it's recommended to unflatten
the graph first:
unflatten -l5 file.dot | dot ...
Also, have a look at Gephi.
Where 5
is the "depth" the links get distributed to.
The root directory contains a .clang-format, which can be used to reformat the
source code with clang-format. Alernatively, use the cmake target
clang-format-source
.
6 years ago | v0.3.0 | Release of version 0.3.0 | tag | commitlog |
6 years ago | v0.2.0 | Version 0.2.0 | tag | commitlog |
7 years ago | v0.1.2 | Version 0.1.2 | tag | commitlog |
7 years ago | v0.1.1 | Tag version v0.1.1. | tag | commitlog |
7 years ago | v0.1 | Tag / Release version 0.1 | tag | commitlog |
6 years ago | master | logtree |
6 years ago | feature/backwardsLinks | logtree |
6 years ago | feature/fixDesign-6 | logtree |
6 years ago | feature/curlCertBundle | logtree |
6 years ago | feature/fixDesign-5 | logtree |
6 years ago | feature/ignoreSslVerification | logtree |
6 years ago | feature/serializerWithVersion | logtree |
6 years ago | feature/serializer | logtree |
6 years ago | feature/addSourceProperty | logtree |
6 years ago | feature/addLibrary | logtree |
6 years ago | feature/fixDesign | logtree |
7 years ago | feature/objectLibrary | logtree |
7 years ago | feature/goDeeper | logtree |
8 years ago | sandbox | logtree |
8 years ago | extremelyUglyCyclicDependency | logtree |