Commits · 9812e5dd7c52e67b22781a440ee04dbb2a086000 · gpt / large_projects / gitlabhq1

Aug 01, 2018

Add repository languages for projects · 79a5d768

Zeger-Jan van de Weg authored 6 years ago

Our friends at GitHub show the programming languages for a long time,
and inspired by that this commit means to create about the same
functionality.

Language detection is done through Linguist, as before, where the
difference is that we cache the result in the database. Also, Gitaly can
incrementaly scan a repository. This is done through a shell out, which
creates overhead of about 3s each run. For now this won't be improved.

Scans are triggered by pushed to the default branch, usually `master`.
However, one exception to this rule the charts page. If we're requesting
this expensive data anyway, we just cache it in the database.

Edge cases where there is no repository, or its empty are caught in the
Repository model. This makes use of Redis caching, which is probably
already loaded.

The added model is called RepositoryLanguage, which will make it harder
if/when GitLab supports multiple repositories per project. However, for
now I think this shouldn't be a concern. Also, Language could be
confused with the i18n languages and felt like the current name was
suiteable too.

Design of the Project#Show page is done with help from @dimitrieh. This
change is not visible to the end user unless detections are done.

Unverified

79a5d768

Jul 31, 2018
- Code cleaning in PostReceive services · 8402d67a
  Francisco Javier López authored 6 years ago and Nick Thomas committed 6 years ago
  
  8402d67a
Jul 30, 2018
- Create GPG commit signature in bulk · 9a81550f
  Francisco Javier López authored 6 years ago and Nick Thomas committed 6 years ago
  
  9a81550f
Jul 16, 2018
- Enable frozen string in apps/uploaders/*.rb · ddca49e4
  gfyoung authored 6 years ago
  
  Partially addresses #47424.
  ddca49e4
May 07, 2018
- Backports every CE related change from ee-5484 to CE · 9a130593
  Tiago Botelho authored 6 years ago
  
  9a130593
Feb 02, 2018
- use Gitlab::UserSettings directly as a singleton instead of including/extending it · eaada9d7
  Mario de la Ossa authored 7 years ago
  
  eaada9d7
Jan 06, 2018
- Protected branch is now created for default branch on import · 819fc98f
  Tiago Botelho authored 7 years ago
  
  819fc98f
Aug 29, 2017
- replace `is_default_branch?` with `default_branch?` · 1c0def2a
  Maxim Rydkin authored 7 years ago
  
  1c0def2a
Aug 16, 2017
- Only create commit GPG signature when necessary · ba7251fe
  Douwe Maan authored 7 years ago
  
  ba7251fe
Jul 28, 2017
- Load and process at most 100 commits when pushing into default branch · 0e355e5c
  Douwe Maan authored 7 years ago
  
  0e355e5c
Jul 27, 2017
- generate gpg signature on push · e63b693f
  Alexis Reigel authored 7 years ago
  
  e63b693f
Jun 21, 2017
- Enable Style/DotPosition Rubocop · 0430b764
  Grzegorz Bizon authored 7 years ago
  
  0430b764
Jun 13, 2017
- Update tests and application · 25b99a5b
  Kamil Trzcińśki authored 7 years ago
  
  25b99a5b
- Allow to access pipelines even if they are disabled, but only present jobs and... · a4a3da76
  Kamil Trzcińśki authored 7 years ago
  
  Allow to access pipelines even if they are disabled, but only present jobs and commit statuses without giving ability to access them
  a4a3da76
May 31, 2017
- Introduce source to pipeline entity · 161af17c
  Kamil Trzcińśki authored 7 years ago
  
  161af17c
May 09, 2017
- Don't use DiffCollection for deltas · 48254d18
  Jacob Vosmaer (GitLab) authored 7 years ago
  
  48254d18
May 04, 2017
- Use regex to skip unnecessary reference processing in ProcessCommitWorker · 020295ff
  James Edwards-Jones authored 7 years ago
  
  020295ff
Apr 03, 2017
- Moved Project#protected_branch? to ProtectedBranch, similar for tags · bf3cc824
  James Edwards-Jones authored 7 years ago
  
  bf3cc824
Mar 07, 2017
- Moved call of SystemHooksService from UpdateMergeRequestsWorker to GitPushServic… · 4bcd900f
  gpongelli authored 8 years ago
  
  4bcd900f
Feb 23, 2017

Revert "Enable Style/DotPosition" · 56de781a

Douwe Maan authored 8 years ago

This reverts commit e00fb2bdc2090e9cabeb1eb35a2672a882cc96e9.

# Conflicts:
#	.rubocop.yml
#	.rubocop_todo.yml
#	lib/gitlab/ci/config/entry/global.rb
#	lib/gitlab/ci/config/entry/jobs.rb
#	spec/lib/gitlab/ci/config/entry/factory_spec.rb
#	spec/lib/gitlab/ci/config/entry/global_spec.rb
#	spec/lib/gitlab/ci/config/entry/job_spec.rb
#	spec/lib/gitlab/ci/status/build/factory_spec.rb
#	spec/lib/gitlab/incoming_email_spec.rb

56de781a

Revert "Prefer leading style for Style/DotPosition" · 1fe7501b
Douwe Maan authored 8 years ago
```
This reverts commit cb10b725c8929b8b4460f89c9d96c773af39ba6b.
```
1fe7501b
Prefer leading style for Style/DotPosition · 206953a4
Douwe Maan authored 8 years ago

206953a4
Enable Style/DotPosition · 3dadf306
Douwe Maan authored 8 years ago

3dadf306

Dec 23, 2016

Schedule at most 100 commits · 89d3ef38

Yorick Peterse authored 8 years ago

When processing push payloads we now schedule at most the 100 most
recent commits, instead of all commits that were in a payload. This
prevents one from overloading the system by pushing thousands if not
millions of commits in a single go.

Fixes https://gitlab.com/gitlab-org/gitlab-ce/issues/25827

Verified

89d3ef38

Dec 21, 2016

Add more storage statistics · 3ef4f74b

Markus Koller authored 8 years ago

This adds counters for build artifacts and LFS objects, and moves
the preexisting repository_size and commit_count from the projects
table into a new project_statistics table.

The counters are displayed in the administration area for projects
and groups, and also available through the API for admins (on */all)
and normal users (on */owned)

The statistics are updated through ProjectCacheWorker, which can now
do more granular updates with the new :statistics argument.

Verified

3ef4f74b

Dec 01, 2016

Pass commit data to ProcessCommitWorker · 6b4d3356

Yorick Peterse authored 8 years ago

By passing commit data to this worker we remove the need for querying
the Git repository for every job. This in turn reduces the time spent
processing each job.

The migration included migrates jobs from the old format to the new
format. For this to work properly it requires downtime as otherwise
workers may start producing errors until they're using a newer version
of the worker code.

Unverified

6b4d3356

Nov 21, 2016

Refactor cache refreshing/expiring · ffb9b3ef

Yorick Peterse authored 8 years ago

This refactors repository caching so it's possible to selectively
refresh certain caches, instead of just expiring and refreshing
everything.

To allow this the various methods that were cached (e.g. "tag_count" and
"readme") use a similar pattern that makes expiring and refreshing
their data much easier.

In this new setup caches are refreshed as follows:

1. After a commit (but before running ProjectCacheWorker) we expire some
   basic caches such as the commit count and repository size.

2. ProjectCacheWorker will recalculate the commit count, repository
   size, then refresh a specific set of caches based on the list of
   files changed in a push payload.

This requires a bunch of changes to the various methods that may be
cached. For one, data should not be cached if a branch used or the
entire repository does not exist. To prevent all these methods from
handling this manually this is taken care of in
Repository#cache_method_output. Some methods still manually check for
the existence of a repository but this result is also cached.

With selective flushing implemented ProjectCacheWorker no longer uses an
exclusive lease for all of its work. Instead this worker only uses a
lease to limit the number of times the repository size is updated as
this is a fairly expensive operation.

Verified

ffb9b3ef

Nov 11, 2016
- Add service that runs after branch removed hooks · ec0daedb
  Grzegorz Bizon authored 8 years ago
  
  ec0daedb
- Do not call environments service in repository model · be3d74e0
  Grzegorz Bizon authored 8 years ago
  
  be3d74e0
Nov 09, 2016
- Stop environments for branch after branch removal · 4a7fcc2a
  Grzegorz Bizon authored 8 years ago
  
  4a7fcc2a
Nov 07, 2016

Process commits in a separate worker · 509910b8

Yorick Peterse authored 8 years ago

This moves the code used for processing commits from GitPushService to
its own Sidekiq worker: ProcessCommitWorker.

Using a Sidekiq worker allows us to process multiple commits in
parallel. This in turn will lead to issues being closed faster and cross
references being created faster. Furthermore by isolating this code into
a separate class it's easier to test and maintain the code.

The new worker also ensures it can efficiently check which issues can be
closed, without having to run numerous SQL queries for every issue.

Unverified

509910b8

Oct 13, 2016
- Extract project#update_merge_requests and SystemHooks to its own worker from GitPushService · bba47886
  Paco Guzman authored 8 years ago
  
  bba47886
Sep 20, 2016

Implement review comments from @yorickpeterse · 8957293d

Timothy Andrew authored 8 years ago

1. Change multiple updates to a single `update_all`

2. Use cascading deletes

3. Extract an average function for the database median.

4. Move database median to `lib/gitlab/database`

5. Use `delete_all` instead of `destroy_all`

6. Minor refactoring

8957293d

Sep 19, 2016

Test all cycle analytics pre-calculation code. · 8f620851

Timothy Andrew authored 8 years ago

All the code that pre-calculates metrics for use in the cycle analytics
page.

- Ci::Pipeline -> build start/finish
- Ci::Pipeline#merge_requests
- Issue -> record default metrics after save
- MergeRequest -> record default metrics after save
- Deployment -> Update "first_deployed_to_production_at" for MR metrics
- Git Push -> Update "first commit mention" for issue metrics
- Merge request create/update/refresh -> Update "merge requests closing issues"

8f620851

Sep 18, 2016

Only create a protected branch upon a push to a new branch if a rule for that branch doesn't exist · b1a4d940

Stan Hu authored 8 years ago

A customer ran into an issue where a Sidekiq task retried over and over, leading to duplicate
master branches in their protected branch list.

Closes #22177

b1a4d940

Sep 15, 2016

Improve performance of the cycle analytics page. · ba25e2f1

Timothy Andrew authored 8 years ago

1. These changes bring down page load time for 100 issues from more than
   a minute to about 1.5 seconds.

2. This entire commit is composed of these types of performance
   enhancements:

     - Cache relevant data in `IssueMetrics` wherever possible.
     - Cache relevant data in `MergeRequestMetrics` wherever possible.
     - Preload metrics

3. Given these improvements, we now only need to make 4 SQL calls:

    - Load all issues
    - Load all merge requests
    - Load all metrics for the issues
    - Load all metrics for the merge requests

4. A list of all the data points that are now being pre-calculated:

    a. The first time an issue is mentioned in a commit

      - In `GitPushService`, find all issues mentioned by the given commit
        using `ReferenceExtractor`. Set the `first_mentioned_in_commit_at`
        flag for each of them.

      - There seems to be a (pre-existing) bug here - files (and
        therefore commits) created using the Web CI don't have
        cross-references created, and issues are not closed even when
        the commit title is "Fixes #xx".

    b. The first time a merge request is deployed to production

      When a `Deployment` is created, find all merge requests that
      were merged in before the deployment, and set the
      `first_deployed_to_production_at` flag for each of them.

    c. The start / end time for a merge request pipeline

      Hook into the `Pipeline` state machine. When the `status` moves to
      `running`, find the merge requests whose tip commit matches the
      pipeline, and record the `latest_build_started_at` time for each
      of them. When the `status` moves to `success`, record the
      `latest_build_finished_at` time.

    d. The merge requests that close an issue

      - This was a big cause of the performance problems we were having
        with Cycle Analytics. We need to use `ReferenceExtractor` to make
        this calculation, which is slow when we have to run it on a large
        number of merge requests.

      - When a merge request is created, updated, or refreshed, find the
        issues it closes, and create an instance of
        `MergeRequestsClosingIssues`, which acts as a join model between
        merge requests and issues.

      - If a `MergeRequestsClosingIssues` instance links a merge request
        and an issue, that issue closes that merge request.

5. The `Queries` module was changed into a class, so we can cache the
   results of `issues` and `merge_requests_closing_issues` across
   various cycle analytics stages.

6. The code added in this commit is untested. Tests will be added in the
   next commit.

ba25e2f1

Aug 16, 2016

Fix failing tests relating to backporting ee!581. · dd3b738d

Timothy Andrew authored 8 years ago

1. `GitPushService` was still using `{merge,push}_access_level_attributes` instead
   of `{merge,push}_access_levels_attributes`.

2. The branches API creates access levels regardless of the state of the
   `developers_can_{push,merge}` parameters. This is in line with the UI,
   where Master access is the default for a new protected branch.

3. Use `after(:build)` to create access levels in the
   `protected_branches` factory, so that `factories_spec` passes. It
   only builds records, so we need to create access levels on `build` as
   well.

dd3b738d

Aug 12, 2016
- Simplify the name for data builder, feedback: · d5264e88
  Lin Jen-Shin authored 8 years ago
  
  https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/5620#note_13671791
  d5264e88
Aug 11, 2016

Pre-create all builds for Pipeline when a trigger is received · 39203f1a

Kamil Trzcińśki authored 8 years ago

This change simplifies a Pipeline processing by introducing a special new status: created.
This status is used for all builds that are created for a pipeline.
We are then processing next stages and queueing some of the builds (created -> pending) or skipping them (created -> skipped).
This makes it possible to simplify and solve a few ordering problems with how previously builds were scheduled.
This also allows us to visualise a full pipeline (with created builds).

This also removes an after_touch used for updating a pipeline state parameters.
Right now in various places we explicitly call a reload_status! on pipeline to force it to be updated and saved.

39203f1a

Aug 04, 2016
- Move those builders to their own namespace, feedback: · 984367f9
  Lin Jen-Shin authored 8 years ago
  
  https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/5620#note_13540099
  984367f9

Admin message

Admin message