Commit Graph

  • 2731e0bdd8 Update the Classifier page links and formatting John Bauer 2023-11-04 20:38:33 -07:00
  • 08a90d7de7 This formatting looks good for the options John Bauer 2023-11-04 20:32:40 -07:00
  • a937288c11 A couple possible updates to the formatting to make it look better John Bauer 2023-11-04 20:31:05 -07:00
  • 2c7e6f14ff Update links to use the .ppt files in the gh-pages part of the repo John Bauer 2023-11-04 20:29:11 -07:00
  • 365a2c3f80 Add the tregex etc. ppt files to the gh-pages John Bauer 2023-11-04 20:27:21 -07:00
  • 356aef8514 More formatting. Include a link to the Ssurgeon javadoc John Bauer 2023-11-04 20:22:01 -07:00
  • a7ea57268d Fix e in Chloé John Bauer 2023-11-04 20:15:18 -07:00
  • 5d6ffd67f4 Add some notes on Ssurgeon and a link to the GURT paper John Bauer 2023-11-04 20:14:42 -07:00
  • 25f04ead14 Comma update John Bauer 2023-11-04 17:16:17 -07:00
  • 298b7fc238 Oops John Bauer 2023-11-04 17:14:36 -07:00
  • 4759dd909b Initial version of scenegraph page John Bauer 2023-11-04 17:12:42 -07:00
  • d90d2f57cb Fix up release history table for NER John Bauer 2023-11-04 17:07:37 -07:00
  • 1a4b6b578d Some formatting fixes for CRF-FAQ John Bauer 2023-11-04 17:07:03 -07:00
  • d85f54d907 release history table was messed up John Bauer 2023-11-04 17:01:20 -07:00
  • c97f5d56bb Heading size changes, whitespace changes for the NER faq John Bauer 2023-11-04 17:00:45 -07:00
  • 23109a999b Update a few links in the NER table John Bauer 2023-11-04 16:55:57 -07:00
  • 0f338a3c27 Oops John Bauer 2023-11-04 16:53:07 -07:00
  • 05e28f0461 Initial versions of the CRF John Bauer 2023-11-04 16:50:46 -07:00
  • 02badb46ba Updates - no double headings, add a TOC John Bauer 2023-11-04 16:44:31 -07:00
  • f5bde8a080 Attempt to fix this link? John Bauer 2023-11-04 16:43:02 -07:00
  • 324e68f86b Update to the tregex faq page John Bauer 2023-11-04 16:39:17 -07:00
  • 1558850980 Fix the table at the bottom of the tregex page, other fixes John Bauer 2023-11-04 16:36:03 -07:00
  • e6af296773 A couple more minor fixes John Bauer 2023-11-04 16:29:52 -07:00
  • 0ab510864a A few manual fixes to the Classifier page John Bauer 2023-11-04 16:28:00 -07:00
  • a626579952 Rough draft of Classifier conversion John Bauer 2023-11-04 16:19:19 -07:00
  • f55b975343 Oops, my bad John Bauer 2023-11-04 15:53:12 -07:00
  • d8fab157bc Candidate conversion of the tregex faq John Bauer 2023-11-04 15:51:34 -07:00
  • edd439fc96 Update with links John Bauer 2023-11-04 15:50:34 -07:00
  • 2528ec68f9 Add a candidate conversion of the Tregex page John Bauer 2023-11-04 15:40:28 -07:00
  • 4d91ec8d78 Add a lowercase form of "-updateMorphoFeatures" for use in EditNode John Bauer 2023-11-03 12:52:31 -07:00
  • 861f72f6cd Add a test to check that the -updateMorphoFeatures EditNode option works John Bauer 2023-10-29 00:31:52 -07:00
  • a37afbb94e Add a block testing the morpho feature capability of Ssurgeon EditNode John Bauer 2023-10-29 00:28:39 -07:00
  • f77a9b414c Sort morphological features in lowercase alphabetical order. Better fits with what is expected in UD datasets John Bauer 2023-10-27 21:46:47 -07:00
  • cb50801f82 Store the CoNNLU features in a dedicated class. This class has a toString() which outputs in the format expected by the CoNLLU files. The big advantage of doing this is that it makes Semgrex recognize the features in the format of UD files John Bauer 2023-10-26 08:48:30 -07:00
  • ec5d719e5d Morphological features need to be in a TreeMap to keep them sorted by key John Bauer 2023-10-26 00:28:01 -07:00
  • 27c6703b5f Add to Ssurgeon the ability to update features without overwriting the whole feature map John Bauer 2023-10-25 23:23:14 -07:00
  • 0e35efe2c8 Fix SsurgeonTest doc John Bauer 2023-10-28 21:38:54 -07:00
  • 3a0bb1e654 These look incorrect - probably a copy/paste error John Bauer 2023-10-26 08:27:47 -07:00
  • c03522b59b Remove the part of the CoreLabel API which removes the lemma when the word is updated. It was kinda annoying and a lot of people seem to dislike it. John Bauer 2023-04-05 12:53:23 -07:00
  • 6707a52c19 Add emptyIndex to the output format for CoreLabel and IndexedWord if they use one of the INDEX outputs. No effect for words which don't have those annotations, which will be most words outside of Semgrex usage John Bauer 2023-10-25 22:41:34 -07:00
  • 038fde260a Make an error message more descriptive John Bauer 2023-10-24 12:19:11 -07:00
  • e08eded765 Add documentation on the morphoFeatures when updating the morphological features using Ssurgeon. Currently, there is no way to update features without wiping out the entire features map. This could be changed if needed John Bauer 2023-10-24 12:18:58 -07:00
  • 6bfbb0f5f1 Language.Unknown can be treated as a generic Universal language when splitting a GrammaticRelation in text form to make a new relation. After all, the code that puts it back together will use : as the separator anyway for Unknown John Bauer 2023-10-24 12:17:01 -07:00
  • 8c345f9896 A missing line of doc in the Ssurgeon documentation John Bauer 2023-10-17 20:13:28 -07:00
  • 03e4542054 Flesh out this exception a bit more so we get a better idea of what's going wrong John Bauer 2023-10-16 15:41:15 -07:00
  • eb50467fa8 Minor whitespace updates to the proto John Bauer 2023-10-16 00:07:34 -07:00
  • fa059831d5 Allow for reading the tokens directly off the DependencyGraph in the protobuf semgrex processing John Bauer 2023-10-16 00:05:15 -07:00
  • 88a997b2f8 Simplify - no need to build a list out of these items before converting John Bauer 2023-10-15 09:21:51 -07:00
  • da4771542f Add a mechanism where roots can be passed even with emptyIndex. The old version still works, in case there are legacy systems out there or old serialized graphs John Bauer 2023-10-14 00:50:27 -07:00
  • c9a77c7489 Add a basic test of the dependency graph serialization for ProtobufAnnotationSerializer John Bauer 2023-10-13 22:12:59 -07:00
  • 941c3b91f2 Use the EmptyIndex fields in the DependencyGraph proto to build a SemanticGraph with the fake nodes used in UD John Bauer 2023-10-13 09:26:41 -07:00
  • d36e2ec6a8 Build a Map from index to CoreLabel John Bauer 2023-10-12 22:52:21 -07:00
  • 92e0762d01 Add emptyIndex as optional fields to the nodes and edges of the DependencyGraph proto. Will allow for the passing of UD graphs with fake words John Bauer 2023-10-13 09:29:38 -07:00
  • 249ee23fca Add index and emptyIndex as optional fields to the Token protobuf. This will be especially useful when passing around dependency graphs with emptyIndex John Bauer 2023-10-12 22:36:30 -07:00
  • a9d3b88064 Create an exception type for a failed serialization / deserialization in the ProtobufAnnotationSerializer John Bauer 2023-10-13 21:53:26 -07:00
  • 091fdc955f Update the xml string output to encorporate EmptyIndex John Bauer 2023-10-13 19:40:13 -07:00
  • e0a3807454 Add .emptyIndex to Semgraph valueOf John Bauer 2023-10-13 12:54:20 -07:00
  • 2819c8e1a5 Check that the indices are being parsed from the SemanticGraph the same way they are being labeled in the valueOf test John Bauer 2023-10-13 17:20:13 -07:00
  • ed0f2de8a7 Oops, need to add emptyIndex to the compareTo method as well John Bauer 2023-10-13 14:24:23 -07:00
  • 693ab2d266 Convert SemanticGraphTest to a newer junit version John Bauer 2023-10-13 13:35:01 -07:00
  • 7c53fb46d8 Move the EmptyIndex into the CoreLabel, as that may be the easiest way to pass it from Python to Java in the Semgrex interface John Bauer 2023-10-12 21:38:08 -07:00
  • 4e3721325d Need to compare EmptyIndex as well when looking at isCopy John Bauer 2023-10-12 21:32:37 -07:00
  • c52438c851 Doc fix John Bauer 2023-10-12 21:27:59 -07:00
  • b9f828e4ed Oops, this should not have any argument John Bauer 2023-10-12 21:20:32 -07:00
  • b65c249b95 When testing the Dependency Converter interface, need to use indices from the original tree starting from 1 instead of 0 John Bauer 2023-10-13 09:03:02 -07:00
  • 9b3f80ec74 Get rid of some .* imports from NERServlet John Bauer 2023-10-08 12:04:20 -07:00
  • 8a671fd065 Update to jakarta servlet John Bauer 2023-10-08 11:52:04 -07:00
  • aa37ebf16f Apparently old versions of java 1.8 don't have strip() John Bauer 2023-10-09 18:44:07 -07:00
  • e1669fcbcc Add a feature to connect a text file to the trees being converted. If used, the raw text will be lined up to find the SpaceAfter. John Bauer 2023-10-04 21:07:15 -07:00
  • 2452d2192c Whitespace John Bauer 2023-10-03 08:16:52 -07:00
  • 1d7aa28c5e Small Javadoc fix John Bauer 2023-10-03 08:13:06 -07:00
  • f8838d2639 Need to add xalan/serializer to the pom files as well John Bauer 2023-09-18 07:25:47 -07:00
  • 082e444cc1 Add a small block of documentation on searching for things with no text John Bauer 2023-09-16 16:36:41 -07:00
  • 3a71d443d0 Update builds and poms to use the new xom & bug-free xalan John Bauer 2023-09-15 17:58:18 -07:00
  • b47b9e9e90 Fix javadoc error John Bauer 2023-09-15 09:18:55 -07:00
  • eacfead8c2 Fix a few more Javadoc errors. Try <strong> instead of <h1> in Tsurgeon John Bauer 2023-09-15 09:13:54 -07:00
  • 01df1c47e8 Fix a bunch of javadoc errors John Bauer 2023-09-15 09:01:55 -07:00
  • ea2d6ef8c7 Fix javadoc errors John Bauer 2023-09-15 08:36:20 -07:00
  • 0352e783db Fix some Ssurgeon javadoc errors John Bauer 2023-09-15 08:35:21 -07:00
  • a4d2aac9dc Minor javadoc fix John Bauer 2023-09-15 08:29:00 -07:00
  • 9975a90630 One of these assertions is no longer necessary, and was messing up intellij unit tests. The others are still valid, so turn them into things that are always checked John Bauer 2023-09-12 23:19:53 -07:00
  • 1a5cea0240 Update CoreNLP to a new version - Fix up some SD and UD conversion errors. Add SceneGraph to the server. Fix Tregex optional bug. fourty and forty (40) days in SUTime. John Bauer 2023-09-06 08:19:54 -07:00
  • b5a632c8de Actually, leave a couple mvn links at 4.5.4 for now v4.5.5 John Bauer 2023-09-06 07:59:46 -07:00
  • 3b92fe0a1b Update readme for CoreNLP John Bauer 2023-09-06 07:58:31 -07:00
  • 4cf0cffc25 Automatic updates of the various pom files etc for the next version John Bauer 2023-09-06 07:56:10 -07:00
  • 53c0d4bb95 Javadoc lists? John Bauer 2023-09-06 01:52:17 -07:00
  • b3c47a0539 make quantity in parenthesis non capture group J38 2023-09-05 20:22:16 -07:00
  • 010bfadf9c update duration rule to allow for optional parenthesis J38 2023-09-05 06:06:21 -07:00
  • 0f69578bdb Add an EmptyIndexAnnotation to account for UD sentences with extra words between words (that is how UD handles copy words in enhanced dependencies, for example) John Bauer 2023-04-26 11:53:28 -07:00
  • de026a38a6 This SuppressWarnings appears to be unnecessary John Bauer 2023-08-30 22:38:53 -07:00
  • 7fbb7b81d3 Accept fourty as 40... probably can't mean anything else, after all John Bauer 2023-08-30 19:24:14 -07:00
  • d5daf7cec6 Clarify an error which might occur when redoing the number test John Bauer 2023-08-30 19:20:23 -07:00
  • 15c2eaf410 This WTS should be unnecessary John Bauer 2023-08-30 19:05:46 -07:00
  • 45b4f6941c Add a useful comment from the commit message Christopher Manning 2023-08-30 11:55:54 -07:00
  • 02644ce164 Fix a comment John Bauer 2023-08-29 14:34:04 -07:00
  • 897231bed0 Remove the shell usage of bz2, which could be a security problem if someone else hijacks the system's bz2 installation. Addresses https://nvd.nist.gov/vuln/detail/CVE-2023-39020 John Bauer 2023-08-29 11:38:36 -07:00
  • 4c2e925eb5 Copyedit Christopher Manning 2023-08-23 17:52:13 -07:00
  • 825c54c494 Merge remote-tracking branch 'refs/remotes/origin/gh-pages' into gh-pages Christopher Manning 2023-08-23 17:45:53 -07:00
  • 65fe225a21 Update docker recommendations Christopher Manning 2023-08-23 17:44:53 -07:00
  • cb900f1fe1 Weird whitespace John Bauer 2023-08-15 14:15:13 -07:00