Makes CoreNLPClient not checks ensure_alive when start_server=StartSe… #1061

marianocrosetti · 2022-06-26T20:20:18Z

Description

Makes CoreNLPClient not check if the server is alive when start_server=StartServer.DONT_START

Fixes Issues

Unit test coverage

test_external_server renamed to test_external_server_available (and modified)
test_external_server_timeout added
test_external_server_unavailable added

pytest executed successfully on stanza/tests/server/test_client.py

Known breaking changes/behaviors

Now when start_server=StartServer.DONT_START the clients must be sure the server is running. Otherwise (if they launched a server instance but didn't wait for enough) they could get a connection error.

…rver.DONT_START Makes CoreNLPClient not checking if the server is alive when start_server=StartServer.DONT_START

AngledLuffa · 2022-06-26T23:06:32Z

stanza/server/client.py

-                raise TimeoutException(r.text)
-            else:
-                raise AnnotationException(r.text)
+        except requests.exceptions.Timeout as e:


good catch!

Thanks
I also take off the:
if r.text == "CoreNLP request timed out. Your document may be too long.":
(from below) because it has probably not sense to check r.text after a HTTP exception (probably it will not have anything)

AngledLuffa · 2022-06-26T23:06:55Z

stanza/tests/server/test_client.py

 """.strip()

+class HTTPMockServerTimeoutContext:
+    """ For lunching an HTTP server on certain port with an specified delay at responses """


Sorry! The spellchecker didn't notice (maybe) because lunching exists. Funny
I corrected with a second commit

AngledLuffa · 2022-06-26T23:12:38Z

stanza/tests/server/test_client.py


+class HTTPMockServerTimeoutContext:
+    """ For lunching an HTTP server on certain port with an specified delay at responses """
+    def __init__(self, port, timeout_secs):


i kind of want this to find an open port... but at the same time i recognize there are a lot of tests which already use 9001 in exactly the same manner

Is it ok if we leave this class as it is (with the parametrized port) and delegate the responsibility of setting the port to the caller?

AngledLuffa · 2022-06-26T23:12:56Z

stanza/tests/server/test_client.py

 [Text=. CharacterOffsetBegin=66 CharacterOffsetEnd=67 PartOfSpeech=.]
 """.strip()

+class HTTPMockServerTimeoutContext:


I like this idea a lot. Thanks!

Thanks, many people also use flask or pytest-httpserver for mocking HTTP servers. Later could be a better option but I think this doesn't introduce any extra dependency (as http.server is installed by default)

AngledLuffa · 2022-06-26T23:20:44Z

one minor change requested, plus if you can think of a way around assuming 9001 is always open, that would be great. thanks!

marianocrosetti · 2022-06-27T14:41:47Z

one minor change requested, plus if you can think of a way around assuming 9001 is always open, that would be great. thanks!

When you bind a socket and give 0 as a port number, the OS will bind it to the first available unused port.

I can refactor HTTPMockServerTimeoutContext so enter returns this port number so we can do something like:

with HTTPMockServerTimeoutContext(0, 20) as port:
    # Here we use f'http://localhost:{port}'

(The binding process occurs at HTTPServer constructor called so it's an easy refactor)

In order to solve the problem for other test cases we can:

Choose a free port doing something like:

import socket
sock = socket.socket()
sock.bind(('', 0))
sock.getsockname()[1]

Credits: SO

Notice that nothing the port remains free (it could happen a running condition, for example if tests are executed in parallalle and interfere each other). I suggest the number (2) solution.

Execute the CoreNLPServer with port=0 and find the port choosed by scanning the ports used by the process
Example of code:

import psutil

def get_port(pid):
    connections = psutil.net_connections()
    for con in connections:
        if con.pid == pid:
            if con.raddr != tuple():
                return con.raddr.port
            if con.laddr != tuple():
                return con.laddr.port
    return -1

corenlp_home = os.getenv('CORENLP_HOME')
start_cmd = 'python -m http.server 8000'
start_cmd = start_cmd and shlex.split(start_cmd)
p = subprocess.Popen(start_cmd)
print(p.pid)
time.sleep(5)
port=get_port(p.pid)

Credits: SO

I tested and on test_client.py and this solution works also for edu.stanford.nlp.pipeline.StanfordCoreNLPServer

Notice that the java edu.stanford.nlp.pipeline.StanfordCoreNLPServer ALWAYS use the port 9000 if not port is provided, so we need to provide "--port 0" (I tested and it works).

Tell me what you prefer. But I think it's better to only modify HTTPMockServerTimeoutContext on this PR. Then I can create other PR for fixing the other test cases if you need.

AngledLuffa · 2022-06-27T23:27:41Z

Since half the test uses 9001 anyway, it's not necessary to change any part of it to scan for open ports. It would be a nice TODO for later, though. I'll take another look when I get home and merge it

AngledLuffa · 2022-06-28T01:28:14Z

Thanks!

Makes CoreNLPClient not checks ensure_alive when start_server=StartSe…

b6d7c68

…rver.DONT_START Makes CoreNLPClient not checking if the server is alive when start_server=StartServer.DONT_START

marianocrosetti mentioned this pull request Jun 26, 2022

ensure_alive must not affect CoreNLPClient when init with StartServer.DONT_START #1059

Closed

AngledLuffa reviewed Jun 26, 2022

View reviewed changes

Mispelling correction

f516fd0

AngledLuffa merged commit c77f345 into stanfordnlp:dev Jun 28, 2022

Makes CoreNLPClient not checks ensure_alive when start_server=StartSe… #1061

Makes CoreNLPClient not checks ensure_alive when start_server=StartSe… #1061

Uh oh!

Conversation

marianocrosetti commented Jun 26, 2022

Description

Fixes Issues

Unit test coverage

Known breaking changes/behaviors

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AngledLuffa commented Jun 26, 2022

Uh oh!

marianocrosetti commented Jun 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AngledLuffa commented Jun 27, 2022

Uh oh!

AngledLuffa commented Jun 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marianocrosetti commented Jun 27, 2022 •

edited

Loading