Commits · 24625323a826d70991fdf27fbbe1f39bee845c03 · gpt / large_projects / gitlabhq1

Jan 24, 2018
- Migrate repository bundling to Gitaly · 24625323
  Ahmad Sherif authored 7 years ago
  
  Closes gitaly#929
  24625323
Jan 18, 2018
- Remove Rugged::Walker from Repository#log · fa520387
  Jacob Vosmaer (GitLab) authored 7 years ago and Douwe Maan committed 7 years ago
  
  fa520387
Jan 17, 2018
- Remove unused methods from Gitlab::Git · f4de7309
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  f4de7309
Jan 11, 2018
- Migrate merged_branch_names to Gitaly · 83101941
  Ahmad Sherif authored 7 years ago
  
  Closes gitaly#851
  83101941
Jan 05, 2018
- Incorporate RemoteService.FetchInternalRemote Gitaly RPC · 9ff44c29
  Alejandro Rodríguez authored 7 years ago
  
  9ff44c29
- Use --left-right and --max-count for counting diverging commits · 33c5630b
  Lin Jen-Shin (godfat) authored 7 years ago
  
  33c5630b
Jan 04, 2018
- Move delete_remote_branches from Gitlab::Shell to Gitlab::Git::Repository · 1c458d17
  Alejandro Rodríguez authored 7 years ago
  
  1c458d17
- Move push_remote_branches from Gitlab::Shell to Gitlab::Git::Repository · 43308bd8
  Alejandro Rodríguez authored 7 years ago
  
  43308bd8
Jan 03, 2018
- Better English · 449b59ec
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  449b59ec
- Handle Gitaly aborted merge due to branch update · 8cf0ea44
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  8cf0ea44
Dec 27, 2017
- Add support for max_count option to Git::Repository#count_commits · 13932b0b
  Ahmad Sherif authored 7 years ago
  
  13932b0b
Dec 22, 2017
- Incorporate Gitaly's RemoteService RPCs · 2694355b
  Alejandro Rodríguez authored 7 years ago
  
  2694355b
- Remove unused method `remote_exists?` · 7354c9f7
  Alejandro Rodríguez authored 7 years ago
  
  7354c9f7
- Replace '.team << [user, role]' with 'add_role(user)' in specs · 27c95364
  blackst0ne authored 7 years ago
  
  27c95364
Dec 14, 2017

Import gitlab_projects.rb from gitlab-shell · 4b785df2

Nick Thomas authored 7 years ago

By importing this Ruby code into gitlab-rails (and gitaly-ruby), we avoid
200ms of startup time for each gitlab_projects subprocess we are eliminating.

By not having a gitlab_projects subprocess between gitlab-rails / sidekiq and
any git subprocesses (e.g. for fork_project, fetch_remote, etc, calls), we can
also manage these git processes more cleanly, and avoid sending SIGKILL to them

Verified

4b785df2

Dec 07, 2017
- Remove Rugged::Repository#empty? · 03ac8d5d
  Zeger-Jan van de Weg authored 7 years ago
  
  03ac8d5d
Dec 06, 2017
- Add feature flag to use gitaly-ssh mirroring when cloning internal repos · 885a4da2
  Alejandro Rodríguez authored 7 years ago
  
  This also allows us to simplify the naming since we can make some fetching methods private.
  885a4da2
Dec 01, 2017
- Gracefully handle case when repository's root ref does not exist · 66127221
  Stan Hu authored 7 years ago
  
  This was failing regularly with an Error 500 when the API branches endpoint was used. Closes #40615
  66127221
Nov 23, 2017
- Clean up repository fetch and mirror methods · 0e6beaf5
  Douwe Maan authored 7 years ago
  
  0e6beaf5
- Make sure repository is restored · 59513200
  Lin Jen-Shin authored 7 years ago
  
  59513200
- Move identical merged branch check to merged_branch_names · 7df1cb52
  Lin Jen-Shin authored 7 years ago
  
  7df1cb52
Nov 20, 2017
- Fix Gitlab::Git::Repository#remote_tags using unexisting variable · 3f0c9e97
  Alejandro Rodríguez authored 7 years ago
  
  3f0c9e97
Nov 17, 2017
- Incorporate Gitaly's RefService.DeleteRefs RPC · 38730a2d
  Alejandro Rodríguez authored 7 years ago
  
  38730a2d
Nov 10, 2017
- Prepare Repository#fetch_source_branch for migration · de301d13
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  de301d13
Nov 07, 2017

Rewrite the GitHub importer from scratch · 4dfe26cd

Yorick Peterse authored 7 years ago

Prior to this MR there were two GitHub related importers:

* Github::Import: the main importer used for GitHub projects
* Gitlab::GithubImport: importer that's somewhat confusingly used for
  importing Gitea projects (apparently they have a compatible API)

This MR renames the Gitea importer to Gitlab::LegacyGithubImport and
introduces a new GitHub importer in the Gitlab::GithubImport namespace.
This new GitHub importer uses Sidekiq for importing multiple resources
in parallel, though it also has the ability to import data sequentially
should this be necessary.

The new code is spread across the following directories:

* lib/gitlab/github_import: this directory contains most of the importer
  code such as the classes used for importing resources.
* app/workers/gitlab/github_import: this directory contains the Sidekiq
  workers, most of which simply use the code from the directory above.
* app/workers/concerns/gitlab/github_import: this directory provides a
  few modules that are included in every GitHub importer worker.

== Stages

The import work is divided into separate stages, with each stage
importing a specific set of data. Stages will schedule the work that
needs to be performed, followed by scheduling a job for the
"AdvanceStageWorker" worker. This worker will periodically check if all
work is completed and schedule the next stage if this is the case. If
work is not yet completed this worker will reschedule itself.

Using this approach we don't have to block threads by calling `sleep()`,
as doing so for large projects could block the thread from doing any
work for many hours.

== Retrying Work

Workers will reschedule themselves whenever necessary. For example,
hitting the GitHub API's rate limit will result in jobs rescheduling
themselves. These jobs are not processed until the rate limit has been
reset.

== User Lookups

Part of the importing process involves looking up user details in the
GitHub API so we can map them to GitLab users. The old importer used
an in-memory cache, but this obviously doesn't work when the work is
spread across different threads.

The new importer uses a Redis cache and makes sure we only perform
API/database calls if absolutely necessary.  Frequently used keys are
refreshed, and lookup misses are also cached; removing the need for
performing API/database calls if we know we don't have the data we're
looking for.

== Performance & Models

The new importer in various places uses raw INSERT statements (as
generated by `Gitlab::Database.bulk_insert`) instead of using Rails
models. This allows us to bypass any validations and callbacks,
drastically reducing the number of SQL queries and Gitaly RPC calls
necessary to import projects.

To ensure the code produces valid data the corresponding tests check if
the produced rows are valid according to the model validation rules.

Verified

4dfe26cd

Nov 03, 2017
- Encapsulate git operations for mirroring in Gitlab::Git · dea6d054
  Alejandro Rodríguez authored 7 years ago
  
  dea6d054
- Add `Gitlab::Git::Repository#fetch` command · 88d2517e
  Alejandro Rodríguez authored 7 years ago
  
  88d2517e
- removed the #ensure_ref_fetched from all controllers · cd88fa8f
  micael.bergeron authored 7 years ago
  
  also, I refactored the MergeRequest#fetch_ref method to express the side-effect that this method has. MergeRequest#fetch_ref -> MergeRequest#fetch_ref! Repository#fetch_source_branch -> Repository#fetch_source_branch!
  cd88fa8f
Nov 02, 2017
- Fix encoding issue with Repository.ls_files · d6066870
  Kim Carlbäcker authored 7 years ago and Douwe Maan committed 7 years ago
  
  d6066870
Nov 01, 2017

Detect changes to LFS pointers for pruning and integrity check · fb3f9c6e

James Edwards-Jones authored 7 years ago

Gitlab::Git::Blob.batch_lfs_metadata can be used to check for LFS pointers. It uses a lazy enumorator and filters by blob size

fb3f9c6e

Oct 31, 2017
- Incorporate Gitaly's OperationService.UserFFBranch RPC · 37cc50f8
  Alejandro Rodríguez authored 7 years ago
  
  37cc50f8
Oct 27, 2017
- Fetch the merged branches at once · 57d7ed05
  Lin Jen-Shin (godfat) authored 7 years ago
  
  57d7ed05
Oct 24, 2017
- Move all rugged operation for ff_merge inside Gitlab::Git · a64601b9
  Alejandro Rodríguez authored 7 years ago
  
  We also delete some unused code related to the aforementioned feature.
  a64601b9
Oct 13, 2017
- Merge Merge Requests via Gitaly · 0aff29f9
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  0aff29f9
Oct 12, 2017
- Fix the format of rugged alternate directory list · a24abf39
  Ahmad Sherif authored 7 years ago
  
  Fixes #39046
  a24abf39
Oct 11, 2017
- Pass git object dir attributes as relative paths to Gitaly · 4378f56c
  Ahmad Sherif authored 7 years ago
  
  Fixes gitaly#629
  4378f56c
Oct 09, 2017
- Add `Gitlab::Git::Repository#fetch` command · 17319343
  Alejandro Rodríguez authored 7 years ago
  
  17319343
Oct 07, 2017

Replaces `tag: true` into `:tag` in the specs · 0ce67858

Jacopo authored 7 years ago

Replaces all the explicit include metadata syntax in the specs (tag:
true) into the implicit one (:tag).
Added a cop to prevent future errors and handle autocorrection.

0ce67858

Oct 04, 2017
- Let fetch_ref pull from Gitaly instead of from disk · 147e2b21
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  147e2b21
- Add OperationService.UserDeleteBranch Gitaly RPC · 79719cf0
  Alejandro Rodríguez authored 7 years ago
  
  79719cf0

Admin message

Admin message