Commits · 24625323a826d70991fdf27fbbe1f39bee845c03 · gpt / large_projects / gitlabhq1

Jan 24, 2018
- Migrate repository bundling to Gitaly · 24625323
  Ahmad Sherif authored 7 years ago
  
  Closes gitaly#929
  24625323
Jan 22, 2018
- Move error-handling to lib/gitlab/git · 728d7e0c
  Kim "BKC" Carlbäcker authored 7 years ago
  
  728d7e0c
Jan 18, 2018
- Wrap Rugged-exceptions in Gitlab::Git::Repository#write_ref · 58aa32bc
  Kim "BKC" Carlbäcker authored 7 years ago
  
  58aa32bc
Jan 16, 2018

Fix project search results for digits surrounded by colons · 82f4564f

Sean McGivern authored 7 years ago

A file containing /:\d+:/ in its contents would break the search results if
those contents were part of the results, because we were splitting on colons,
which can't work with untrusted input.

Changing to use the null byte as a separator is much safer.

82f4564f

Move Regexp.escape(), fix formatting on tests. · b1cf3225
Andrew McCallum authored 7 years ago

b1cf3225

Jan 15, 2018
- Account for query of only forward slash(es). · a539e03d
  Andrew McCallum authored 7 years ago
  
  a539e03d
- Create intermediary variable for query value with leading slashes removed. · 7ce732fb
  Andrew McCallum authored 7 years ago
  
  7ce732fb
- Add space after comma per layout guidelines. · ae106913
  Andrew McCallum authored 7 years ago
  
  ae106913
- Strip off leading slashes when searching in context of repository. · e5afb44a
  Andrew McCallum authored 7 years ago
  
  e5afb44a
Jan 12, 2018
- Introduce PredicateMemoization cop and fix offenses · 4f00a051
  Lin Jen-Shin authored 7 years ago
  
  with StrongMemoize
  4f00a051
Jan 11, 2018
- Adds Rubocop rule for line break around conditionals · 729f05f0
  🙈 jacopo beschi 🙉 authored 7 years ago and Robert Speicher committed 7 years ago
  
  729f05f0
- Migrate Repository#can_be_merged? to Gitaly · b4b267b7
  Ahmad Sherif authored 7 years ago
  
  b4b267b7
Jan 10, 2018
- Fix hooks not being set up properly for bare import Rake task · 35d3411f
  Stan Hu authored 7 years ago
  
  Closes #41739
  35d3411f
Jan 09, 2018
- Client-prep Gitlab::Git::Repository#write_ref · 657065b7
  Kim "BKC" Carlbäcker authored 7 years ago
  
  657065b7
Jan 05, 2018

Move git operations for multi_action into Gitlab::Git · 0b07be59
Alejandro Rodríguez authored 7 years ago

0b07be59
Use --left-right and --max-count for counting diverging commits · 33c5630b
Lin Jen-Shin (godfat) authored 7 years ago

33c5630b

Backport 'Rebase' feature from EE to CE · 27a75ea1

Jan Provaznik authored 7 years ago

When a project uses fast-forward merging strategy user has
to rebase MRs to target branch before it can be merged.
Now user can do rebase in UI by clicking 'Rebase' button
instead of doing rebase locally.

This feature was already present in EE, this is only backport
of the feature to CE. Couple of changes:
* removed rebase license check
* renamed migration (changed timestamp)

Closes #40301

27a75ea1

Dec 22, 2017
- Remove unused method `remote_exists?` · 7354c9f7
  Alejandro Rodríguez authored 7 years ago
  
  7354c9f7
Dec 20, 2017
- Revert "Merge branch 'repo-write-ref-client-prep' into 'master'" · 28fba5ed
  Kim Carlbäcker authored 7 years ago
  
  This reverts merge request !15712
  28fba5ed
Dec 19, 2017

Load commit in batches for pipelines#index · c6edae38

Zeger-Jan van de Weg authored 7 years ago

Uses `list_commits_by_oid` on the CommitService, to request the needed
commits for pipelines. These commits are needed to display the user that
created the commit and the commit title.

This includes fixes for tests failing that depended on the commit
being `nil`. However, now these are batch loaded, this doesn't happen
anymore and the commits are an instance of BatchLoader.

Unverified

c6edae38

Dec 14, 2017
- sorting for tags api · e7b40c2f
  haseeb authored 7 years ago
  
  e7b40c2f
Dec 13, 2017

Adds ordering to projects contributors in API · 55f32208

Jacopo authored 7 years ago

Allows ordering in GET api/v4/projects/:project_id/repository/contributors
through `order_by` and `sort` params.
The available `order_by` options are: name|email|commits.
The available `sort` options are: asc|desc.

55f32208

Migrate Gitlab::Git::Repository#merge_base_commit to Gitaly · 835a5db3
Ahmad Sherif authored 7 years ago
```
Closes gitaly#808
```
835a5db3

Dec 12, 2017
- Avoid Gitaly N+1 calls by caching tag_names · f8c3a58a
  Stan Hu authored 7 years ago
  
  f8c3a58a
- Move Repository#write_ref to Git::Repository#write_ref · dad4c0b6
  Kim "BKC" Carlbäcker authored 7 years ago
  
  dad4c0b6
Dec 07, 2017
- Remove Rugged::Repository#empty? · 03ac8d5d
  Zeger-Jan van de Weg authored 7 years ago
  
  03ac8d5d
Dec 06, 2017
- Unify mirror addition operations to prepare for Gitaly migration · 95009cef
  Alejandro Rodríguez authored 7 years ago
  
  95009cef
Dec 01, 2017

Use commit finder instead of rev parse · 020a8482

Zeger-Jan van de Weg authored 7 years ago

This has the side effect of making this method rugged call free, which
is the reason I actually changed this.

Unverified

020a8482

Nov 23, 2017
- Rename fetch_refs to refmap · 7a1e93d3
  Douwe Maan authored 7 years ago
  
  7a1e93d3
- Clean up repository fetch and mirror methods · 0e6beaf5
  Douwe Maan authored 7 years ago
  
  0e6beaf5
- Move identical merged branch check to merged_branch_names · 7df1cb52
  Lin Jen-Shin authored 7 years ago
  
  7df1cb52
Nov 21, 2017

Use Redis cache for branch existence checks · 00cd5d93
Jacob Vosmaer (GitLab) authored 7 years ago

00cd5d93

Batchload blobs for diff generation · f9565e30

Zeger-Jan van de Weg authored 7 years ago

After installing a new gem, batch-loader, a construct can be used to
queue data to be fetched in bulk. The gem was also introduced in both
gitlab-org/gitlab-ce!14680 and gitlab-org/gitlab-ce!14846, but those mrs
are not merged yet.

For the generation of diffs, both the old blob and the new blob need to
be loaded. This for every file in the diff, too. Now we collect all
these so we do 1 fetch. Three `.allow_n_plus_1_calls` have been removed,
which I expect to be valid, but this needs to be confirmed by a full CI
run.

Possibly closes:
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37445
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37599
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37431

Unverified

f9565e30

Nov 17, 2017

Fix conflict highlighting · 64a9e53b

Sean McGivern authored 7 years ago

Conflicts used to take a `Repository` and pass that to
`Gitlab::Highlight.highlight`, which would call `#gitattribute` on the
repository. Now they use a `Gitlab::Git::Repository`, which didn't have that
method defined - but defining it on `Gitlab::Git::Repository` does make it
available on `Repository` through `method_missing`, so we can do that and both
cases will work.

64a9e53b

Nov 16, 2017
- Adds Rubocop rule for line break after guard clause · 181cd299
  Jacopo authored 7 years ago
  
  Adds a rubocop rule (with autocorrect) to ensure line break after guard clauses.
  181cd299
- Optimise getting the pipeline status of commits · ab16a6fb
  Yorick Peterse authored 7 years ago
  
  This adds an optimised way of getting the latest pipeline status for a list of Commit objects (or just a single one).
  Verified
  
  ab16a6fb
Nov 10, 2017
- Prepare Repository#fetch_source_branch for migration · de301d13
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  de301d13
Nov 07, 2017

Rewrite the GitHub importer from scratch · 4dfe26cd

Yorick Peterse authored 7 years ago

Prior to this MR there were two GitHub related importers:

* Github::Import: the main importer used for GitHub projects
* Gitlab::GithubImport: importer that's somewhat confusingly used for
  importing Gitea projects (apparently they have a compatible API)

This MR renames the Gitea importer to Gitlab::LegacyGithubImport and
introduces a new GitHub importer in the Gitlab::GithubImport namespace.
This new GitHub importer uses Sidekiq for importing multiple resources
in parallel, though it also has the ability to import data sequentially
should this be necessary.

The new code is spread across the following directories:

* lib/gitlab/github_import: this directory contains most of the importer
  code such as the classes used for importing resources.
* app/workers/gitlab/github_import: this directory contains the Sidekiq
  workers, most of which simply use the code from the directory above.
* app/workers/concerns/gitlab/github_import: this directory provides a
  few modules that are included in every GitHub importer worker.

== Stages

The import work is divided into separate stages, with each stage
importing a specific set of data. Stages will schedule the work that
needs to be performed, followed by scheduling a job for the
"AdvanceStageWorker" worker. This worker will periodically check if all
work is completed and schedule the next stage if this is the case. If
work is not yet completed this worker will reschedule itself.

Using this approach we don't have to block threads by calling `sleep()`,
as doing so for large projects could block the thread from doing any
work for many hours.

== Retrying Work

Workers will reschedule themselves whenever necessary. For example,
hitting the GitHub API's rate limit will result in jobs rescheduling
themselves. These jobs are not processed until the rate limit has been
reset.

== User Lookups

Part of the importing process involves looking up user details in the
GitHub API so we can map them to GitLab users. The old importer used
an in-memory cache, but this obviously doesn't work when the work is
spread across different threads.

The new importer uses a Redis cache and makes sure we only perform
API/database calls if absolutely necessary.  Frequently used keys are
refreshed, and lookup misses are also cached; removing the need for
performing API/database calls if we know we don't have the data we're
looking for.

== Performance & Models

The new importer in various places uses raw INSERT statements (as
generated by `Gitlab::Database.bulk_insert`) instead of using Rails
models. This allows us to bypass any validations and callbacks,
drastically reducing the number of SQL queries and Gitaly RPC calls
necessary to import projects.

To ensure the code produces valid data the corresponding tests check if
the produced rows are valid according to the model validation rules.

Verified

4dfe26cd

Nov 06, 2017
- Cache the root ref SHA in an instance variable in Repository#merged_to_root_ref? · ad937d27
  Rémy Coutable authored 7 years ago
  
  Signed-off-by: Rémy Coutable <remy@rymai.me>
  Verified
  
  ad937d27
Nov 03, 2017
- Use Gitlab::Git operations for repository mirroring · fe4874c4
  Alejandro Rodríguez authored 7 years ago
  
  fe4874c4

Admin message

Admin message