Page Menu
Home
Software Heritage
Search
Configure Global Search
Log In
Files
F9341385
No One
Temporary
Actions
View File
Edit File
Delete File
View Transforms
Subscribe
Mute Notifications
Award Token
Flag For Later
Size
3 KB
Subscribers
None
View Options
diff --git a/README.md b/README.md
index ba6ff69..0dedc3a 100644
--- a/README.md
+++ b/README.md
@@ -1,119 +1,141 @@
swh-loader-svn
==============
Documents are in the ./docs folder:
- Specification: ./docs/swh-loader-svn.txt
- Comparison performance with git-svn: ./docs/comparison-git-svn-swh-svn.org
# Configuration file
## Location
Either:
- /etc/softwareheritage/
- ~/.config/swh/
- ~/.swh/
Note: Will call that location $SWH_CONFIG_PATH
## Configuration sample
$SWH_CONFIG_PATH/loader/svn.yml:
```
storage:
cls: remote
args:
url: http://localhost:5002/
check_revision: 10
```
## configuration content
With at least the following module (swh.loader.svn.tasks) and queue
(swh_loader_svn):
$SWH_CONFIG_PATH/worker.yml:
```
task_broker: amqp://guest@localhost//
task_modules:
task_modules:
- swh.loader.svn.tasks
task_queues:
- swh_loader_svn
task_soft_time_limit = 0
```
`swh.loader.svn.tasks` and `swh_loader_svn` are the important entries here.
## toplevel
+### local svn repository
+
```
$ python3
repo = 'pyang-repo-r343-eol-native-mixed-lf-crlf'
#repo = 'zipeg-gae'
origin_url = 'http://%s.googlecode.com' % repo
local_repo_path = '/home/storage/svn/repo'
svn_url = 'file://%s/%s' % (local_repo_path, repo)
import logging
logging.basicConfig(level=logging.DEBUG)
from swh.loader.svn.tasks import LoadSvnRepository
t = LoadSvnRepository()
t.run(svn_url=svn_url,
- destination_path='/tmp',
origin_url=origin_url, visit_date='2016-05-03T15:16:32+00:00',
start_from_scratch=True)
```
+### repository dump
+
+```
+$ python3
+repo = '0-512-md'
+archive_name = '%s-repo.svndump.gz' % repo
+archive_path = '/home/storage/svn/dumps/%s' % archive_name
+origin_url = 'http://%s.googlecode.com' % repo
+svn_url = 'file://%s' % repo
+
+import logging
+logging.basicConfig(level=logging.DEBUG)
+
+from swh.loader.svn.tasks import MountAndLoadSvnRepository
+
+t = MountAndLoadSvnRepository()
+t.run(archive_path=archive_path,
+ origin_url=origin_url,
+ visit_date='2016-05-03T15:16:32+00:00',
+ start_from_scratch=True)
+```
+
## Production like
start worker instance
To start a current worker instance:
```sh
python3 -m celery worker --app=swh.scheduler.celery_backend.config.app \
--pool=prefork \
--concurrency=10 \
-Ofair \
--loglevel=debug 2>&1
```
## Produce a repository to load
You can see:
`python3 -m swh.loader.svn.producer svn --help`
### one repository
```sh
python3 -u -m swh.loader.svn.producer svn --svn-url file:///home/storage/svn/repos/pkg-fox --visit-date 'Tue, 3 May 2017 17:16:32 +0200'
```
Note:
- `--visit-date` to override the default visit-date to now.
### multiple repositories
```sh
cat ~/svn-repository-list | python3 -m swh.loader.svn.producer svn
```
The file svn-repository-list contains a list of svn repository urls
(one per line), something like:
```txt
svn://svn.debian.org/svn/pkg-fox/ optional-url
svn://svn.debian.org/svn/glibc-bsd/ optional-url
svn://svn.debian.org/svn/pkg-voip/ optional-url
svn://svn.debian.org/svn/python-modules/ optional-url
svn://svn.debian.org/svn/pkg-gnome/ optional-url
```
## Produce archive of svndumps list to load
see. `python3 -m swh.loader.svn.producer svn-archive --help`
-
File Metadata
Details
Attached
Mime Type
text/x-diff
Expires
Fri, Jul 4, 11:59 AM (3 w, 2 d ago)
Storage Engine
blob
Storage Format
Raw Data
Storage Handle
3291776
Attached To
rDLDSVN Subversion (SVN) loader
Event Timeline
Log In to Comment