{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":429410014,"defaultBranch":"master","name":"spacy-experimental","ownerLogin":"explosion","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2021-11-18T11:51:52.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/20011530?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1698747393.0","currentOid":""},"activityList":{"items":[{"before":"63a0b0d51dce34be7b4a932324855a858e6fe672","after":"283e1d88ab952a2aa51392982b4a5bb115d112a4","ref":"refs/heads/v4","pushedAt":"2024-04-23T19:54:23.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"svlandeg","name":"Sofie Van Landeghem","path":"/svlandeg","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8796347?s=80&v=4"},"commit":{"message":"Remove the biaffine parser (#59)\n\nThe biaffine parser is now in the separate `spacy-biaffine-parser`\r\npackage: https://github.com/explosion/spacy-biaffine-parser","shortMessageHtmlLink":"Remove the biaffine parser (#59)"}},{"before":"a139a5eccdef8650c56350e66281f31f8df277b7","after":"63a0b0d51dce34be7b4a932324855a858e6fe672","ref":"refs/heads/v4","pushedAt":"2024-04-09T13:24:32.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"svlandeg","name":"Sofie Van Landeghem","path":"/svlandeg","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8796347?s=80&v=4"},"commit":{"message":"Fix Cython build for Python 3.12 (#58)\n\n* Fix Cython build for Python 3.12\r\n\r\nWhile at it:\r\n\r\n- Add CI tests for Python 3.12.\r\n- Remove 3.8 support in preparation of spaCy v4.\r\n\r\n* GH actions: ignore Metal 2.0 paravirt warning","shortMessageHtmlLink":"Fix Cython build for Python 3.12 (#58)"}},{"before":"549138fcd3dcd49fe4f02f9d5bbe387035856301","after":"66c4be536d4e69b8f6366ceb2c0ebb39079d2c89","ref":"refs/heads/master","pushedAt":"2023-11-03T19:22:18.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Update for python 3.12 (#57)\n\n* Update for python 3.12\r\n\r\n* Remove profiling\r\n* Add python 3.12 to CI\r\n\r\n* CI: Install torch separately for <3.12","shortMessageHtmlLink":"Update for python 3.12 (#57)"}},{"before":"d1293248bc6c9f07dd0db0e57748d0af148fe324","after":"549138fcd3dcd49fe4f02f9d5bbe387035856301","ref":"refs/heads/master","pushedAt":"2023-10-31T10:15:26.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"shadeMe","name":"Madeesh Kannan","path":"/shadeMe","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/214450?s=80&v=4"},"commit":{"message":"Set version to `0.6.4` (#56)","shortMessageHtmlLink":"Set version to 0.6.4 (#56)"}},{"before":"da581da6f4c5de8c63924642f7fc5f0bd281958c","after":"d1293248bc6c9f07dd0db0e57748d0af148fe324","ref":"refs/heads/master","pushedAt":"2023-10-30T16:04:53.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"shadeMe","name":"Madeesh Kannan","path":"/shadeMe","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/214450?s=80&v=4"},"commit":{"message":"Increment spaCy upper-bound to `<3.8.0` (#55)","shortMessageHtmlLink":"Increment spaCy upper-bound to <3.8.0 (#55)"}},{"before":"796d3cfef09e54630a7db305d3bf4e3ff7f828a9","after":"da581da6f4c5de8c63924642f7fc5f0bd281958c","ref":"refs/heads/master","pushedAt":"2023-07-24T12:58:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Switch from distutils to setuptools (#54)","shortMessageHtmlLink":"Switch from distutils to setuptools (#54)"}},{"before":"9e96d0a20891ab8d04c2493cae54d75110b6b425","after":"796d3cfef09e54630a7db305d3bf4e3ff7f828a9","ref":"refs/heads/master","pushedAt":"2023-07-20T09:14:05.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Update numpy build constraints for numpy 1.25 (#53)\n\nStarting in numpy 1.25 (see\r\nhttps://github.com/numpy/numpy/releases/tag/v1.25.0), the numpy C API is\r\nbackwards-compatible by default.\r\n\r\nFor python 3.9+, we should be able to drop the specific numpy build\r\nrequirements and use `numpy>=1.25`, which is currently\r\nbackwards-compatible to `numpy>=1.19`.\r\n\r\nIn the future, the python <3.9 requirements could be dropped and the\r\nlower numpy pin could correspond to the oldest supported version for the\r\ncurrent lower python pin.","shortMessageHtmlLink":"Update numpy build constraints for numpy 1.25 (#53)"}},{"before":"f9df30272224b306ce00873963ab9ce859a00d31","after":"9e96d0a20891ab8d04c2493cae54d75110b6b425","ref":"refs/heads/master","pushedAt":"2023-07-04T08:53:05.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Set version to v0.6.3 (#52)","shortMessageHtmlLink":"Set version to v0.6.3 (#52)"}},{"before":"9dd89d5ed2e4abd160f898c4a4ebae1b297ffb61","after":"f9df30272224b306ce00873963ab9ce859a00d31","ref":"refs/heads/master","pushedAt":"2023-07-04T08:09:18.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Extend to spacy v3.6 (#51)\n\n* Extend to spacy v3.6\r\n\r\n* Temporarily test with v3.6.0.dev1\r\n\r\n* Revert \"Temporarily test with v3.6.0.dev1\"\r\n\r\nThis reverts commit 208de300ffe7ee3a571a3832d28dcc08f7ab9a41.","shortMessageHtmlLink":"Extend to spacy v3.6 (#51)"}},{"before":"4bd3c08745e16df1d828a3a2a6d7e9376ad10e7d","after":"a139a5eccdef8650c56350e66281f31f8df277b7","ref":"refs/heads/v4","pushedAt":"2023-06-26T09:30:51.454Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"shadeMe","name":"Madeesh Kannan","path":"/shadeMe","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/214450?s=80&v=4"},"commit":{"message":"Make the memory use of the pairwise linear model constant (#45)\n\n* Make the memory use of the pairwise linear model constant\r\n\r\nThe Thinc part of the pairwire bilinear model is fairly simple before\r\nthis change: we would collect the splits from all documents and then pad\r\nthem. However, this caused the model to run out of memory on large docs,\r\nsince it has to compute many n*n matrices (all padded to the longest\r\nsequence length). It would also perform unnecessary computations on\r\nmany padding time steps.\r\n\r\nThis change make the memory use independent of the doc length (given a\r\nfixed split length) by doing the following:\r\n\r\n- Get all splits and flatten to a list of split representations.\r\n (`with_splits`)\r\n- Batch the splits by their padded sizes. This ensures that memory\r\n use is constant when splits have a maximum size. This also permits\r\n some buffering, so that we get more equisized batches.\r\n (`with_minibatch_by_padded_size`)\r\n- The splits in the batches are padded and passed to the Torch model.\r\n Since the outputs of the Torch model are matrices, we unpad taking\r\n this into account. (`with_pad_seq_unpad_matrix`)\r\n\r\nIn contrast to most `with_*` layers, `with_splits` is not symmetric. It\r\ntakes at its input representations for each document (`List[Floats2d]`),\r\nhowever it outputs pairwise score matrices per split. The reason is that\r\nsince the dimensions of the score matrices differ per split, we cannot\r\nconcatenate them at a document level.\r\n\r\n* Fixup pairwise bilinears backprop\r\n\r\n* Remove unused imports\r\n\r\n* Type fixes\r\n\r\n* Remove workaround for NER training not working with sentence boundaries\r\n\r\nThis needs to be fixed in the transition-based parser.\r\n\r\n* Use unflatten_matrix in ArcPredicter.predict\r\n\r\n* with_splits: more semantic naming\r\n\r\n* Factor out memoize test utility function\r\n\r\n* Documents shapes conversion function and fix a type\r\n\r\n* cupy.testing.assert_equal does not exist\r\n\r\nUse explicitly from numpy.\r\n\r\n* Fix seed in ArcLabeler/Predicter overfit tests\r\n\r\n* Fix typo\r\n\r\nCo-authored-by: Madeesh Kannan \r\n\r\n* Another typo\r\n\r\n---------\r\n\r\nCo-authored-by: Madeesh Kannan ","shortMessageHtmlLink":"Make the memory use of the pairwise linear model constant (#45)"}},{"before":"a9c67b98e7763d2e97bf4e24e02071705d52e919","after":"4bd3c08745e16df1d828a3a2a6d7e9376ad10e7d","ref":"refs/heads/v4","pushedAt":"2023-06-23T17:50:29.763Z","pushType":"pr_merge","commitsCount":7,"pusher":{"login":"danieldk","name":"Daniël de Kok","path":"/danieldk","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/49398?s=80&v=4"},"commit":{"message":"Merge pull request #50 from danieldk/sync-v4-master-20230612\n\nMerge `master` into `v4`","shortMessageHtmlLink":"Merge pull request #50 from danieldk/sync-v4-master-20230612"}},{"before":"8a59090f7a62b00859da5af09597c2a78bf362e4","after":"9dd89d5ed2e4abd160f898c4a4ebae1b297ffb61","ref":"refs/heads/master","pushedAt":"2023-04-21T14:02:43.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"svlandeg","name":"Sofie Van Landeghem","path":"/svlandeg","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8796347?s=80&v=4"},"commit":{"message":"CI: Switch tests from Azure to GHA (#48)\n\n* CI: Switch tests from Azure to GHA\r\n\r\n* Update torch install\r\n\r\n* Skip -Werror for python 3.6","shortMessageHtmlLink":"CI: Switch tests from Azure to GHA (#48)"}},{"before":"32acc1e392384b5b9479b2ae76f9c38e47b9f673","after":"a9c67b98e7763d2e97bf4e24e02071705d52e919","ref":"refs/heads/v4","pushedAt":"2023-04-18T15:59:57.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"danieldk","name":"Daniël de Kok","path":"/danieldk","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/49398?s=80&v=4"},"commit":{"message":"Replace `torch.nn.Bilinear` by our own implementation (#47)\n\n* Replace `torch.nn.Bilinear` by our own implementation\r\n\r\nTorch's bilinear layer uses the native torch `bilinear` function, which\r\nis seemingly less efficient (lots of kernel launches) than using\r\n`einsum`.\r\n\r\n* Add missing type annotations\r\n\r\n* Describe motivation for our own Bilinear implementation\r\n\r\n* Add test for Bilinear, testing against the Torch implementation\r\n\r\n* Ignore bilinear test when torch is not installed\r\n\r\n* Fix typo\r\n\r\nCo-authored-by: Madeesh Kannan \r\n\r\n---------\r\n\r\nCo-authored-by: Madeesh Kannan ","shortMessageHtmlLink":"Replace torch.nn.Bilinear by our own implementation (#47)"}},{"before":"6abf5bb64a2c387cd91fd202f50e6eee844a1f0e","after":"8a59090f7a62b00859da5af09597c2a78bf362e4","ref":"refs/heads/master","pushedAt":"2023-04-05T11:50:02.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"CI: Add GHA tests (#46)","shortMessageHtmlLink":"CI: Add GHA tests (#46)"}},{"before":"d3da095792bcd3944f849d5be9b7f66dda96c710","after":"32acc1e392384b5b9479b2ae76f9c38e47b9f673","ref":"refs/heads/v4","pushedAt":"2023-03-30T08:14:30.692Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"danieldk","name":"Daniël de Kok","path":"/danieldk","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/49398?s=80&v=4"},"commit":{"message":"arc_predicter: replace lazy splitting by iterative implementation (#43)\n\n* arc_predicter: replace lazy splitting by iterative implementation\r\n\r\nThe `split_lazily` function was defined recursively. This can cause the\r\nPython interpreter reach the maximum recursion depth. This wouldn't\r\nhappen under normal circumstances when the recursion depth is\r\nO(log2(N)). However, the softmax layer of senter is zero-initialized,\r\ngiving each token the same split probability. This results in the\r\ndegenerate case where we split recursively after each token.\r\n\r\nThis change avoids such corner cases by replacing the implementation\r\nby an iterative solution.\r\n\r\n* Remove superfluous space\r\n\r\nCo-authored-by: Madeesh Kannan \r\n\r\n* Change queue terminology to stack\r\n\r\n---------\r\n\r\nCo-authored-by: Madeesh Kannan ","shortMessageHtmlLink":"arc_predicter: replace lazy splitting by iterative implementation (#43)"}},{"before":"2dc1592e93fb6eba0f815e6270ef5566f1b7d85b","after":"d3da095792bcd3944f849d5be9b7f66dda96c710","ref":"refs/heads/v4","pushedAt":"2023-03-23T07:57:16.000Z","pushType":"pr_merge","commitsCount":5,"pusher":{"login":"danieldk","name":"Daniël de Kok","path":"/danieldk","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/49398?s=80&v=4"},"commit":{"message":"Merge pull request #44 from danieldk/chore/merge-master-20230322\n\nSync `v4` with `master`","shortMessageHtmlLink":"Merge pull request #44 from danieldk/chore/merge-master-20230322"}},{"before":"bfff1de28eebb8d2a350ae5fa417127c272e359c","after":"2dc1592e93fb6eba0f815e6270ef5566f1b7d85b","ref":"refs/heads/v4","pushedAt":"2023-03-20T17:11:58.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"adrianeboyd","name":"Adriane Boyd","path":"/adrianeboyd","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/5794899?s=80&v=4"},"commit":{"message":"Support lazy, recursive sentence splitting (#40)\n\n* Support lazy, recursive sentence splitting\r\n\r\nWe use sentence splitting in the biaffine parser to keep the O(n^2)\r\nbiaffine attention model tractable. However, since the sentence splitter\r\nmakes errors, the parser may not have the correct head available.\r\n\r\nThis change adds another splitting strategy. The goal of this strategy\r\nis to use the highest-probability splits to partition a doc until each\r\npartition is smaller than or equal to a maximum length. This reduces\r\nthe number of attachment errors as a result of incorrect sentence\r\nsplits, while providing an upper bound on complexity.\r\n\r\nThe algorithm works as follows:\r\n\r\n* If the length |d| > max_length:\r\n - Find the highest-probability split in d according to senter.\r\n - Split d into d_1 and d_2 using the highest probability split.\r\n - Recursively apply this algorithm to d_1 and d_2.\r\n* Otherwise: do nothing\r\n\r\n* `max_tokens` -> `max_length`\r\n\r\n* Remove unused `is_train` argument from `split_lazily`\r\n\r\n* Do not try to get the senter pipe\r\n\r\n* Rename `senter` option to `senter_name`\r\n\r\n* Fix example project\r\n\r\nCo-authored-by: Adriane Boyd \r\n\r\n* Formatting fixes\r\n\r\nCo-authored-by: Adriane Boyd \r\n\r\n* ArcPredicter.__init__: `Language` -> `Vocab` arg\r\n\r\n* Update spacy_experimental/biaffine_parser/arc_predicter.pyx\r\n\r\n---------\r\n\r\nCo-authored-by: Adriane Boyd ","shortMessageHtmlLink":"Support lazy, recursive sentence splitting (#40)"}},{"before":"aa91deca801ad20f9bde2a831524910dbb4a7eea","after":"bfff1de28eebb8d2a350ae5fa417127c272e359c","ref":"refs/heads/v4","pushedAt":"2023-03-10T16:57:08.064Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"svlandeg","name":"Sofie Van Landeghem","path":"/svlandeg","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8796347?s=80&v=4"},"commit":{"message":"Arc{Labeler,Predicter}: clean up argument and defaults (#41)","shortMessageHtmlLink":"Arc{Labeler,Predicter}: clean up argument and defaults (#41)"}},{"before":"745f17f1cbb06e2045add082c89fd58b426ecfca","after":"6abf5bb64a2c387cd91fd202f50e6eee844a1f0e","ref":"refs/heads/master","pushedAt":"2023-03-07T12:59:23.649Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"svlandeg","name":"Sofie Van Landeghem","path":"/svlandeg","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8796347?s=80&v=4"},"commit":{"message":"Fix all existing MyPy errors and enable MyPy in CI (#42)\n\n* Fix all existing MyPy errors and enable MyPy in CI\r\n\r\n* Fixup some more issues in `span_finder`\r\n\r\nFor some reason these didn't show up with my locally installed\r\nMyPy version.","shortMessageHtmlLink":"Fix all existing MyPy errors and enable MyPy in CI (#42)"}}],"hasNextPage":false,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEOHNzxQA","startCursor":null,"endCursor":null}},"title":"Activity · explosion/spacy-experimental"}