From 405c98a6a0e017f41f5de9c649a8f6f1b3fc4314 Mon Sep 17 00:00:00 2001 From: Patrick Steinhardt Date: Thu, 19 Mar 2026 06:33:23 +0100 Subject: contrib: move "update-unicode.sh" script into "tools/" The "update-unicode.sh" script is used to update the unicode data compiled into Git whenever a new version of the Unicode standard has been released. As such, it is a natural part of our developer-facing tooling, and its presence in "contrib/" is misleading. Promote the script into the new "tools/" directory. Signed-off-by: Patrick Steinhardt Signed-off-by: Junio C Hamano --- tools/update-unicode/README | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) create mode 100644 tools/update-unicode/README (limited to 'tools/update-unicode/README') diff --git a/tools/update-unicode/README b/tools/update-unicode/README new file mode 100644 index 0000000000..151a197041 --- /dev/null +++ b/tools/update-unicode/README @@ -0,0 +1,20 @@ +TL;DR: Run update_unicode.sh after the publication of a new Unicode +standard and commit the resulting unicode-widths.h file. + +The long version +================ + +The Git source code ships the file unicode-widths.h which contains +tables of zero and double width Unicode code points, respectively. +These tables are generated using update_unicode.sh in this directory. +update_unicode.sh itself uses a third-party tool, uniset, to query two +Unicode data files for the interesting code points. + +On first run, update_unicode.sh clones uniset from Github and builds it. +This requires a current-ish version of autoconf (2.69 works per December +2016). + +On each run, update_unicode.sh checks whether more recent Unicode data +files are available from the Unicode consortium, and rebuilds the header +unicode-widths.h with the new data. The new header can then be +committed. -- cgit v1.3-5-g9baa