Skip to content

Commit 4ef5028

Browse files
authored
Merge pull request #7 from facebookresearch/master
Merge upstream
2 parents 50d21a0 + 25d0bb0 commit 4ef5028

19 files changed

+293
-116
lines changed

.circleci/config.yml

Lines changed: 60 additions & 60 deletions
Original file line numberDiff line numberDiff line change
@@ -153,59 +153,59 @@ jobs:
153153
command: |
154154
. .circleci/gcc_test.sh
155155
156-
"debian-wheezy-gcc":
157-
docker:
158-
- image: debian:wheezy
159-
working_directory: ~/repo
160-
steps:
161-
- checkout
162-
- run:
163-
command: |
164-
. .circleci/setup_debian.sh
165-
. .circleci/gcc_test.sh
166-
167-
"debian-wheezy-cmake":
168-
docker:
169-
- image: debian:wheezy
170-
working_directory: ~/repo
171-
steps:
172-
- checkout
173-
- run:
174-
command: |
175-
. .circleci/setup_debian.sh
176-
. .circleci/cmake_test.sh
177-
178-
"debian-jessie-gcc":
179-
docker:
180-
- image: debian:jessie
181-
working_directory: ~/repo
182-
steps:
183-
- checkout
184-
- run:
185-
command: |
186-
. .circleci/setup_debian.sh
187-
. .circleci/gcc_test.sh
188-
189-
"debian-jessie-cmake":
190-
docker:
191-
- image: debian:jessie
192-
working_directory: ~/repo
193-
steps:
194-
- checkout
195-
- run:
196-
command: |
197-
. .circleci/setup_debian.sh
198-
. .circleci/cmake_test.sh
199-
200-
"debian-jessie-python":
201-
docker:
202-
- image: debian:jessie
203-
working_directory: ~/repo
204-
steps:
205-
- checkout
206-
- run:
207-
command: |
208-
. .circleci/setup_debian.sh
156+
"debian-wheezy-gcc":
157+
docker:
158+
- image: debian:wheezy
159+
working_directory: ~/repo
160+
steps:
161+
- checkout
162+
- run:
163+
command: |
164+
. .circleci/setup_debian.sh
165+
. .circleci/gcc_test.sh
166+
167+
"debian-wheezy-cmake":
168+
docker:
169+
- image: debian:wheezy
170+
working_directory: ~/repo
171+
steps:
172+
- checkout
173+
- run:
174+
command: |
175+
. .circleci/setup_debian.sh
176+
. .circleci/cmake_test.sh
177+
178+
"debian-jessie-gcc":
179+
docker:
180+
- image: debian:jessie
181+
working_directory: ~/repo
182+
steps:
183+
- checkout
184+
- run:
185+
command: |
186+
. .circleci/setup_debian.sh
187+
. .circleci/gcc_test.sh
188+
189+
"debian-jessie-cmake":
190+
docker:
191+
- image: debian:jessie
192+
working_directory: ~/repo
193+
steps:
194+
- checkout
195+
- run:
196+
command: |
197+
. .circleci/setup_debian.sh
198+
. .circleci/cmake_test.sh
199+
200+
"debian-jessie-python":
201+
docker:
202+
- image: debian:jessie
203+
working_directory: ~/repo
204+
steps:
205+
- checkout
206+
- run:
207+
command: |
208+
. .circleci/setup_debian.sh
209209
pip install .
210210
python runtests.py -u
211211
@@ -217,10 +217,10 @@ jobs:
217217
- checkout
218218
- run:
219219
command: |
220-
git config --global user.email "cpuhrsch@users.noreply.github.com"
220+
git config --global user.email "docusaurus-bot@users.noreply.github.com"
221221
git config --global user.name "Website Deployment Script"
222-
echo "machine github.com login cpuhrsch password $GITHUB_TOKEN" > ~/.netrc
223-
cd website && npm install && GIT_USER=cpuhrsch npm run publish-gh-pages
222+
echo "machine github.com login docusaurus-bot password $GITHUB_TOKEN_DOCUSAURUS_BOT" > ~/.netrc
223+
cd website && npm install && GIT_USER=docusaurus-bot npm run publish-gh-pages
224224
225225
workflows:
226226
version: 2
@@ -239,8 +239,8 @@ workflows:
239239
- "gcc7"
240240
- "gcclatest"
241241
- "website-build"
242-
- "debian-wheezy-gcc"
243-
- "debian-wheezy-cmake"
244-
- "debian-jessie-gcc"
245-
- "debian-jessie-cmake"
242+
- "debian-wheezy-gcc"
243+
- "debian-wheezy-cmake"
244+
- "debian-jessie-gcc"
245+
- "debian-jessie-cmake"
246246
- "debian-jessie-python"

README.md

Lines changed: 15 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ This repository hosts unofficial Windows binary builds of fastText, a library fo
4242

4343
### Models
4444
- Recent state-of-the-art [English word vectors](https://fasttext.cc/docs/en/english-vectors.html).
45-
- Word vectors for [294 languages trained on Wikipedia](https://github.com/facebookresearch/fastText/blob/master/pretrained-vectors.md).
45+
- Word vectors for [157 languages trained on Wikipedia and Crawl](https://github.com/facebookresearch/fastText/blob/master/docs/crawl-vectors.md).
4646
- Models for [language identification](https://fasttext.cc/docs/en/language-identification.html#content) and [various supervised tasks](https://fasttext.cc/docs/en/supervised-models.html#content).
4747

4848
### Supplementary data
@@ -104,7 +104,7 @@ $ unzip v0.1.0.zip
104104
$ cd fastText-0.1.0
105105
$ make
106106
```
107-
107+
108108
This will produce object files for all the classes as well as the main binary `fasttext`.
109109
If you do not plan on using the default system-wide compiler, update the two macros defined at the beginning of the Makefile (CC and INCLUDES).
110110

@@ -233,7 +233,7 @@ This will create a `.ftz` file with a smaller memory footprint. All the standard
233233
```
234234
$ ./fasttext test model.ftz test.txt
235235
```
236-
The quantization procedure follows the steps described in [3](#fastext-zip). You can
236+
The quantization procedure follows the steps described in [3](#fasttextzip-compressing-text-classification-models). You can
237237
run the script `quantization-example.sh` for an example.
238238

239239

@@ -293,11 +293,14 @@ Please cite [1](#enriching-word-vectors-with-subword-information) if using this
293293
[1] P. Bojanowski\*, E. Grave\*, A. Joulin, T. Mikolov, [*Enriching Word Vectors with Subword Information*](https://arxiv.org/abs/1607.04606)
294294

295295
```
296-
@article{bojanowski2016enriching,
296+
@article{bojanowski2017enriching,
297297
title={Enriching Word Vectors with Subword Information},
298298
author={Bojanowski, Piotr and Grave, Edouard and Joulin, Armand and Mikolov, Tomas},
299-
journal={arXiv preprint arXiv:1607.04606},
300-
year={2016}
299+
journal={Transactions of the Association for Computational Linguistics},
300+
volume={5},
301+
year={2017},
302+
issn={2307-387X},
303+
pages={135--146}
301304
}
302305
```
303306

@@ -306,11 +309,14 @@ Please cite [1](#enriching-word-vectors-with-subword-information) if using this
306309
[2] A. Joulin, E. Grave, P. Bojanowski, T. Mikolov, [*Bag of Tricks for Efficient Text Classification*](https://arxiv.org/abs/1607.01759)
307310

308311
```
309-
@article{joulin2016bag,
312+
@InProceedings{joulin2017bag,
310313
title={Bag of Tricks for Efficient Text Classification},
311314
author={Joulin, Armand and Grave, Edouard and Bojanowski, Piotr and Mikolov, Tomas},
312-
journal={arXiv preprint arXiv:1607.01759},
313-
year={2016}
315+
booktitle={Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers},
316+
month={April},
317+
year={2017},
318+
publisher={Association for Computational Linguistics},
319+
pages={427--431},
314320
}
315321
```
316322

0 commit comments

Comments
 (0)