-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deterministic arrival and service queue not producing expected number of events #260
Comments
Hi! I'm just going to address the first example (due to time constraints). @geraintpalmer would be the main person to weigh in, but I'll add my comments here. Correct me where I am wrong. The first thing that came to my mind when I saw the example is that Ciw only shows 'completed' records. That does not mean there are not individuals in the systems. Added to your example: >>> print(Q.nodes[1].individuals)
[[Individual 23]]
>>> print(Q.nodes[1].individuals[0][0].arrival_date)
23 To get that 24th individual to show up in the records we need to run the simulation long enough for them to actually complete their service. This would suggest that you should simulate until import ciw
import pandas as pd
N = ciw.create_network(
arrival_distributions=[ciw.dists.Deterministic(value=1)],
service_distributions=[ciw.dists.Deterministic(value=1)],
number_of_servers=[1]
)
ciw.seed(1)
Q = ciw.Simulation(N)
sim_time = 25.01
Q.simulate_until_max_time(sim_time)
recs = pd.DataFrame(Q.get_all_records())
print('Theoretical Arrivals: ', 24)
print('Actual Arrivals: ', recs.shape[0]) |
Thanks @galenseilis! Appreciate the insights here It sounds like If we look at making metrics like queue lengths, wait times, etc. it seems like it's being censored if we only consider what's in the departure node. |
Sorry for being terse, but short on time. Briefly:
I agree, this is censoring in the statistical sense of the word. What I've done to collect these incomplete records is loop over each non-exit node, and for each node loop over the individuals in that node to collect some of their attributes. You can take a look at what is in IMO having all the records, including incomplete ones, should be the default. Maybe that will get put in as a feature at some point. |
Ciw isn't necessariliy censoring unfinished individuals. It's just that a data record is only created once service is complete. The gap you are seeing is due to two reasons:
I hope that explains what you are seeing. |
thanks a ton @galenseilis and @geraintpalmer! this is helpful for me to see that the arrivals were generated, but weren't making it to the
Thanks for sharing! Had no idea about this project, excited to see what you've built here
Would love it if this was the default! I think it's helpful to make sure we're not biasing our metrics. These examples are extreme, but considering my third example, if we measure the queue length at the exit node by using And perhaps a bit pedantic, but I think the naming of the method is misleading and obscures the fact that there's censoring here (but I see here you do call out that it's just reporting on the exit node). Perhaps it can be something like |
Hello there 👋
I'm trying to understand how
ciw
handles deterministic arrivals and services in a simple queue simulation. Specifically, I'm seeing fewer arrivals than expected, and I'd appreciate some guidance on whether this is a bug or expected behavior. I am working on simulating a very simple queue where:Since arrivals are deterministic, I would like to see the theoretical number of arrivals match what
ciw
returns after the simulation, and there appears to be a discrepancy (which may totally be user error on my side 😅 ). Curious if anyone can help out and opine on some of the issues I'm having.In this dummy experiment, I would expect 24 arrivals in the system. However, I'm producing 22 arrivals in this simulation:
Interestingly, there also appears to be some influence of the number of servers. Suppose I have a the following shift schedule, which maps to EST business hours in UTC times. I intend any arrivals outside of business hours to join the waiting queue and then be serviceable in business hours. This produces 20 arrivals and similarly I would expect 24 arrivals (1 per hour for 24 hours).
Finally, if I introduce customer classes, this effect is compounded even further. Suppose I have an experiment where:
num_customer
customer classes, which each arrive at the same deterministic rate. (NB I intentionally do not want to thin these by multiplying by the proportion of the customer as documented, happy to go into more detail if needed)For this simulation, I would expect
num_customer * 24
arrivals (240 in this case), but am seeing 22 arrivals.Any advice would be really appreciated! Thanks all
The text was updated successfully, but these errors were encountered: