- PVSM.RU - https://www.pvsm.ru -
Discovering git vendor
extension.
Cross-post from my medium blog: https://medium.com/opsops/git-vendor-295db4bcec3a [1]
I would like to introduce the proper way to handle vendoring of git repositories.
Vendoring is a way to integrate other’s work into your own. It’s the opposite of ‘linking’ against third-party library. Instead of having that library as a dependency, application uses this library as a part of own source code and keep that code ‘inside’ itself.
Normally, vendoring is done by language tooling: bundler, cargo, pip, etc. But sometimes you need to vendor something not covered by any existing toolset, or something multi-language, that it’s impossible to find the ‘core’ language tool for that.
The solution for this situation is vendoring on a git level. You have your own git repository (I call it ‘destination repo’), and you want to incorporate some other repository (I call it ‘source repo’) as a directory into your (destination repo).
The things you expect from a well-designed vendoring system (regardless of Git it is or not):
And, giving the git nature of Git, you want that system to be branch-friendly. If branch A have vendored code at version a1, and branch B at version b1, you want to switch between them every time you switch between A and B. Moreover, you want to be able to change version a1 to a2, and version b1 to b2 without worries about versions in another branch.
… And you want to be able to vendor more than one external repository, so vendoring shouldn’t be one-time event per repo.
As you can see it’s a long list of requirements. I analyzed existing (other) solutions before getting to the best solution (git vendor).
Copy-paste is so vicious way to vendor anything that I have nothing good to say about it. You loose provenance, visibility, updatability. You don’t loose transportability though, as there is no link to the old repo in the first place. Don’t do vendoring like that.
This is a stupid but somewhat working trick. Create a folder vendored_foobar into your repository, go into vendored_foobar and clone that foobar. Go back to the top-level and commit all changes you have.
Pros: simple to do, provide local provenance, governance and an excellent patchablity.
Cons: It’s brittle, it does not survive push (nested .git folder is not included into your repo, so for external observers vendored code is indistinguishable from your own). So you’ll loose transportability, and a provenance in a long run.
Submodules
The idea is that you have some folders of your git managed in another git. It’s the oldest ‘something’ git had provided. Unfortunately, it’s branch-unfriendly, and it lacks governance over vendored code. If remote repo is gone, you can’t use your submodules.
And don’t forget how hard is to clone this repo.
Git can use ‘subtree’ way of merging external gits as folders into the local git repository. It’s almost perfect, except for updatability, repeatability, and provenance, and visibility. Barring the manual digging into a git history, it’s impossible to see which part of the git repo is vendored and which is not. And you have no idea where those changes are coming from. If committer hasn’t wrote this, information is lost. And if s/he has, it’s not repeatable as minor changes can happens during filling that information.
So, enter the prized winner, git vendor.
Git vendor is an amazing extension for git written by Brett Langdon about three years ago. It’s just around 200 lines in bash, but it’s so well-written that I have no complains about it at all (it has everything a good program should have: manual pages, help, bash completion, reasonable error handling and failsafe guards).
It uses git subtree and extends it with functions to cover loose sides of vendoring by git subtree.
Every important point is checked:
git vendor list
and see all vendored stuff.git vendor update
, and it supports for branches, tags and commits as a way pin-point what exactly to take. And you can do git vendor remove
, of course.It has specific policy on how stuff is vendored: if you want to clone https://github.com/serverscom/dibctl
it goes into vendor/github.com/serverscom/dibctl/
. You can change ‘vendor/
’ part, but the rest is a hard policy. Symlinks can easier that, though.
There are few minor bugs there: you can’t use it on empty repos, you can’t use local gits as sources, you can’t see help until you are in git repo. None of them cause problems during normal work with real repos.
git-vendor is a perfect tool to vendor one git repo into another. It provides all required functionality for the best practices of vendoring: keeping provenance, providing visibility and updatability.
Автор: amarao
Источник [2]
Сайт-источник PVSM.RU: https://www.pvsm.ru
Путь до страницы источника: https://www.pvsm.ru/programmirovanie/310330
Ссылки в тексте:
[1] https://medium.com/opsops/git-vendor-295db4bcec3a: https://medium.com/opsops/git-vendor-295db4bcec3a
[2] Источник: https://habr.com/ru/post/442306/?utm_campaign=442306
Нажмите здесь для печати.